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12 Human Secreted Proteins 


Field of the Invention 

This invention relates to newly identified polynucleotides and the polypeptides 
encoded by these polynucleotides, uses of such polynucleotides and polypeptides, and 
their production. 

Background of the Invention 

Unlike bacterium, which exist as a single compartment surrounded by a 
membrane, human cells and other eucaryotes are subdivided by membranes into many 
functionally distinct compartments. Each membrane-bounded compartment, or 
organelle, contains different proteins essential for the function of the organelle. The cell 
uses "sorting signals," which are amino acid motifs located within the protein, to target 
proteins to particular cellular organelles. 

One type of sorting signal, called a signal sequence, a signal peptide, or a leader 
sequence, directs a class of proteins to an organelle called the endoplasmic reticulum 
(ER). The ER separates the membrane- bounded proteins from all other types of proteins. 
Once localized to the ER, both groups of proteins can be further directed to another 
organelle called the Golgi apparatus. Here, the Golgi distributes the proteins to vesicles, 
including secretory vesicles, the cell membrane, lysosomes, and the other organelles. 

Proteins targeted to the ER by a signal sequence can be released into the 
extracellular space as a secreted protein. For example, vesicles containing secreted 
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proteins can fuse with the cell membrane and release their contents into the extracellular 
space - a process called exocytosis. Exocytosis can occur constitutively or after receipt 
of a triggering signal. In the latter case, the proteins are stored in secretory vesicles (or 
secretory granules) until exocytosis is triggered. Similarly, proteins residing on the cell 
5 membrane can also be secreted into the extracellular space by proteolytic cleavage of a 
"linker" holding the protein to the membrane. 

Despite the great progress made in recent years, only a small number of genes 
encoding human secreted proteins have been identified. These secreted proteins include 
the commercially valuable human insulin, interferon, Factor VIII, human growth 
10 hormone, tissue plasminogen activator, and erythropoeitin. Thus, in light of the 

pervasive role of secreted proteins in human physiology, a need exists for identifying and 
characterizing novel human secreted proteins and the genes that encode them. This 
knowledge will allow one to detect, to treat, and to prevent medical disorders by using 
secreted proteins or the genes that encode them. 

15 

Summary of the Invention 

The present invention relates to novel polynucleotides and the encoded 
polypeptides. Moreover, the present invention relates to vectors, host cells, antibodies, 
20 and recombinant and synthetic methods for producing the polypeptides and 

polynucleotides. Also provided are diagnostic methods for detecting disorders and 
conditions related to the polypeptides and polynucleotides, and therapeutic methods for 
treating such disorders and conditions. The invention further relates to screening 
methods for identifying binding partners of the polypeptides. 
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Detailed Description 

Definitions 

The following definitions are provided to facilitate understanding of certain terms 
used throughout this specification. 

In the present invention, "isolated" refers to material removed from its original 
environment (e.g., the natural environment if it is naturally occurring), and thus is altered 
"by the hand of man" from its natural state. For example, an isolated polynucleotide 
could be part of a vector or a composition of matter, or could be contained within a cell, 
and still be "isolated" because that vector, composition of matter, or particular cell is not 
the original environment of the polynucleotide. The term "isolated" does not refer to 
genomic or cDNA libraries, whole cell total or mRNA preparations, genomic DNA 
preparations (including those separated by electrophoresis and transferred onto blots), 
sheared whole cell genomic DNA preparations or other compositions where the art 
demonstrates no distinguishing features of the polynucleotide/sequences of the present 
invention. 

In the present invention, a "secreted" protein refers to those proteins capable of 
being directed to the ER, secretory vesicles, or the extracellular space as a result of a 
signal sequence, as well as those proteins released into the extracellular space without 
necessarily containing a signal sequence. If the secreted protein is released into the 
extracellular space, the secreted protein can undergo extracellular processing to produce 
a "mature" protein. Release into the extracellular space can occur by many mechanisms, 
including exocytosis and proteolytic cleavage. 

In specific embodiments, the polynucleotides of the invention are at least 15, at 
least 30, at least 50, at least 100, at least 125, at least 500, or at least 1000 continuous 
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nucleotides but are less than or equal to 300 kb, 200 kb, 100 kb, 50 kb, 15 kb, 10 kb, 7.5 
kb, 5 kb, 2.5 kb, 2.0 kb, or 1 kb, in length. In a further embodiment, polynucleotides of 
the invention comprise a portion of the coding sequences, as disclosed herein, but do not 
comprise all or a portion of any intron. In another embodiment, the polynucleotides 
comprising coding sequences do not contain coding sequences of a genomic flanking 
gene (i.e., 5' or 3' to the gene of interest in the genome). In other embodiments, the 
polynucleotides of the invention do not contain the coding sequence of more than 1000, 
500, 250, 100, 50, 25, 20, 15, 10, 5, 4, 3, 2, or 1 genomic flanking gene(s). 

As used herein, a "polynucleotide" refers to a molecule having a nucleic acid 
sequence contained in SEQ ID NO:X or the cDNA contained within the clone deposited 
with the ATCC. For example, the polynucleotide can contain the nucleotide sequence of 
the full length cDNA sequence, including the 5' and 3' untranslated sequences, the coding 
region, with or without the signal sequence, the secreted protein coding region, as well as 
fragments, epitopes, domains, and variants of the nucleic acid sequence. Moreover, as 
used herein, a "polypeptide" refers to a molecule having the translated amino acid 
sequence generated from the polynucleotide as broadly defined. 

In the present invention, the full length sequence identified as SEQ ID NO;X was 
often generated by overlapping sequences contained in multiple clones (contig analysis). 
A representative clone containing all or most of the sequence for SEQ ID NO:X was 
deposited with the American Type Culture Collection ("ATCC"). As shown in Table 
XIII, each clone is identified by a cDNA Clone ID (Identifier) and the ATCC Deposit 
Number. The ATCC is located at 10801 University Boulevard, Manassas, Virginia 
201 10-2209, USA. The ATCC deposit was made pursuant to the terms of the Budapest 
Treaty on the international recognition of the deposit of microorganisms for purposes of 
patent procedure. 

A "polynucleotide" of the present invention also includes those polynucleotides 
capable of hybridizing, under stringent hybridization conditions, to sequences contained 
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in SEQ ID NO:X, the complement thereof, or the cDNA within the clone deposited with 
the ATCC. "Stringent hybridization conditions" refers to an overnight incubation at 42 
degree C in a solution comprising 50% formamide, 5\ SSC (750 mM NaCl, 75 mM 
trisodium citrate), 50 mM sodium phosphate (pH 7.6), 5x Denhardt's solution, 10% 
dextran sulfate, and 20 fig/m] denatured, sheared salmon sperm DNA, followed by 
washing the filters in O.lx SSC at about 65 degree C. 

Also contemplated are nucleic acid molecules that hybridize to the 
polynucleotides of the present invention at lower stringency hybridization conditions. 
Changes in the stringency of hybridization and signal detection are primarily 
accomplished through the manipulation of formamide concentration (lower percentages 
of formamide result in lowered stringency); salt conditions, or temperature. For 
example, lower stringency conditions include an overnight incubation at 37 degree C in a 
solution comprising 6X SSPE (20X SSPE = 3M NaCl; 0.2M NaH 2 P0 4 ; 0.02M EDTA, 
pH 7.4), 0.5% SDS, 30% formamide, 100 ug/ml salmon sperm blocking DNA; followed 
by washes at 50 degree C with 1XSSPE, 0.1% SDS. In addition, to achieve even lower 
stringency, washes performed following stringent hybridization can be done at higher 
salt concentrations (e.g. 5X SSC). 

Note that variations in the above conditions may be accomplished through the 
inclusion and/or substitution of alternate blocking reagents used to suppress background 
in hybridization experiments. Typical blocking reagents include Denhardt's reagent, 
BLOTTO, heparin, denatured salmon sperm DNA, and commercially available 
proprietary formulations. The inclusion of specific blocking reagents may require 
modification of the hybridization conditions described above, due to problems with 
compatibility. 

Of course, a polynucleotide which hybridizes only to polyA+ sequences (such as 
any 3' terminal polyA+ tract of a cDNA shown in the sequence listing), or to a 
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complementary stretch of T (or U) residues, would not be included in the definition of 
"polynucleotide," since such a polynucleotide would hybridize to any nucleic acid 
molecule containing a poly (A) stretch or the complement thereof (e.g., practically any 
double-stranded cDNA clone generated using oligo dT as a primer). 

The polynucleotide of the present invention can be composed of any 
polyribonucleotide or polydeoxribonucleotide, which may be unmodified RNA or DNA 
or modified RNA or DNA. For example, polynucleotides can be composed of single- 
and double stranded DNA, DNA that is a mixture of single- and double-stranded regions, 
single- and double-stranded RNA, and RNA that is mixture of single- and double- 
stranded regions, hybrid molecules comprising DNA and RNA that may be single- 
stranded or, more typically, double-stranded or a mixture of single- and double-stranded 
regions. In addition, the polynucleotide can be composed of triple- stranded regions 
comprising RNA or DNA or both RNA and DNA. A polynucleotide may also contain 
one or more modified bases or DNA or RNA backbones modified for stability or for 
other reasons. "Modified" bases include, for example, tritylated bases and unusual bases 
such as inosine. A variety of modifications can be made to DNA and RNA; thus, 
"polynucleotide" embraces chemically, enzymaticaliy, or metabolically modified forms. 

The polypeptide of the present invention can be composed of amino acids joined 
to each other by peptide bonds or modified peptide bonds, i.e., peptide isosteres, and may 
contain amino acids other than the 20 gene-encoded amino acids. The polypeptides may 
be modified by either natural processes, such as posttranslational processing, or by 
chemical modification techniques which are well known in the art. Such modifications 
are well described in basic texts and in more detailed monographs, as well as in a 
voluminous research literature. Modifications can occur anywhere in a polypeptide, 
including the peptide backbone, the amino acid side-chains and the amino or carboxyl 
termini. It will be appreciated that the same type of modification may be present in the 
same or varying degrees at several sites in a given polypeptide. Also, a given 
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polypeptide may contain many types of modifications. Polypeptides may be branched , 
for example, as a result of ubiquitination, and they may be cyclic, with or without 
branching. Cyclic, branched, and branched cyclic polypeptides may result from 
posttranslation natural processes or may be made by synthetic methods. Modifications 
include acetylation, acylation, ADP-ribosylation, amidation, covalent attachment of 
flavin, covalent attachment of a heme moiety, covalent attachment of a nucleotide or 
nucleotide derivative, covalent attachment of a lipid or lipid derivative, covalent 
attachment of phosphotidylinositol, cross-linking, cyclization, disulfide bond formation, 
demethyiation, formation of covalent cross-links, formation of cysteine, formation of 
pyroglutamate, formylation, gamma-carboxylation, glycosylation, GPI anchor formation, 
hydroxylation, iodination, methylation, myristoylation, oxidation, pegylation, proteolytic 
processing, phosphorylation, prenylation, racemization, selenoylation, sulfation, transfer- 
RNA mediated addition of amino acids to proteins such as arginylation, and 
ubiquitination. (See, for instance, PROTEINS - STRUCTURE AND MOLECULAR 
PROPERTIES, 2nd Ed., T. E. Creighton, W. H. Freeman and Company, New York 
(1993); POSTTRANSLATIONAL COVALENT MODIFICATION OF PROTEINS, B. 
C. Johnson, Ed., Academic Press, New York, pgs. 1-12 (1983); Seifter et al., Meth 
Enzymol 182:626-646 (1990); Rattan et al., Ann NY Acad Sci 663:48-62 (1992).) 

"SEQ ID NO:X" refers to a polynucleotide sequence while "SEQ ID NO:Y" 
refers to a polypeptide sequence, both sequences identified by an integer specified in 
Table XIII. 

"A polypeptide having biological activity" refers to polypeptides exhibiting 
activity similar, but not necessarily identical to, an activity of a polypeptide of the 
present invention, including mature forms, as measured in a particular biological assay, 
with or without dose dependency. In the case where dose dependency does exist, it need 
not be identical to that of the polypeptide, but rather substantially similar to the dose- 
dependence in a given activity as compared to the polypeptide of the present invention 
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(i.e., the candidate polypeptide will exhibit greater activity or not more than about 25- 
fold less and, preferably, not more than about tenfold less activity, and most preferably, 
not more than about three-fold less activity relative to the polypeptide of the present 
invention.) 

Polynucleotides and Polypeptides of the Invention 
FEATURES OF PROTEIN ENCODED BY GENE NO: 1 

The translation product of this gene shares sequence homology with a protein 
from Xenopus laevis that is described as upregulated in response to thyroid hormone in 
tadpoles, and is thought to be important in the tail resorption process during Xenopus 
laevis metamorphosis (See Proc. Natl. Acad, Sci. USA (1996 Mar. 5):93(5): 1924-9, 
which is herein incorporated by reference). In addition, translation product of this gene 
shares sequence homology with a recently described group of proteins, called hedgehog 
interacting proteins (HIPs) (See International Publication No. W098/12326, which is 
herein incorporated by reference). These proteins bind to hedgehog polypeptides such as 
Shh and Dhh with high affinity (Kd approx. 1 nM). HIPs exhibit spatiallyand temporally 
restricted expression domains indicative of important roles in hedgehog-mediated 
induction. They regulate differentiation of neuronal ceils, regulate survival of 
differentiated neuronal cells, proliferation of chondrocytes, proliferation of testicular 
germ line cells and/or expression of patched or hedgehog genes. The biological activity 
of this polypeptide is assayed by techniques known in the art, otherwise disclosed herein 
and as described in Internationa] Publication No. W098/12326, which is herein 
incorporated by reference. 

Preferred polypeptides of the invention comprise the following amino acid 
sequence: 
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MLRTSTPNLCGGLHCRAPWLSSGILCLCLIFLLGQVGLLQGHPQCLDYGPPFQPP 

LHLEFCSDYESFGCCDQHKDRRIAARYWDIMEYFDLKRHELCGDYIKDILCQEC 

SPYAAHLYDAENTQTPLRNLPGLCSDYCSAFHSNCHSAISLLTNDRGLQESHGRD 

GTRFCHLLDLPDKDYCFPNVLRNDYLNRHLGMVAQDPQGCLQLCLSEVANGLR 

NPVSMVHAGDGTHRFFVAEQVGVVWVYLPDGSRLEQPFLDLKNIVLTTPWIGD 

ERGFLGLAFHPKFRHNRKFYIYYSCLDKKKVEKIRISEMKVSRADPNKADLKSER 

VILEIEEPASNHNGGQLLFGLDGYMYIFTGDGGQAGDPFGLFGNAQNKSSLLGK 

VLRIDVNRAGSHGKRYRVPSDNPFVSEPGAHPAIYAYGIRNMWRCAVDRGDPIT 

RQGRGRIFCGDVGQNRFEEVDLILKGGNYGWRAKEGFACYDKKLCHNASLDDV 

LPIYAYGHAVGKSVTGGYVYRGCESPNLNGLYIFGDFMSGRLMALQEDRKNKK 

WKKQDLCLGSTTSCAFPGLISTHSKFIISFAEDEAGELYFLATSYPSAYAPRGSIYK 

FVDPSRRAPPGKCKYKPVPVRTKSKRIPFRPLAKTVLDLLKEQSEKAARKSSSAT 

LASGPAQGLSEKGSSKKLASPTSSKNTLRGPGTKKKARVGPHVRQGKRRKSLKS 

HSGRMRPSAEQKRAGRSLP (SEQ ID NO: 47). Also preferred are polypeptides 

comprising the mature polypeptide which is predicted to consist of residues 42-724 of the 

foregoing sequence, and biologically active fragments of the mature polypeptide. 

Figures 1 A-C show the nucleotide (SEQ ID NO: 1 1) and deduced amino acid 
sequence (SEQ ID NO: 29) of this protein. 

Figure 2 shows the regions of similarity between the amino acid sequences of 
SEQ ID NO:29, the Xenopus laevis tail resorption protein (gi| 1234787) (SEQ ID 
NO:48), and the Hedgehog Interacting Protein ("HIP"; gi|AAD31 172.1) (SEQ ID 
NO:49). 

Figure 3 shows an analysis of the amino acid sequence of SEQ ID NO: 29. 
Alpha, beta, turn and coil regions; hydrophilicity and hydrophobicity; amphipathic 
regions; flexible regions; antigenic index and surface probability are shown. 

Northern analysis indicates that a 2.5-3.0 kb transcript of this gene is expressed 
primarily in testes tissue and A549 lung carcinoma tissue, but interestingly is absent from 
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normal lung tissue. This gene is also expressed in osteoarthritis tissue and human fetal 
tissues. 

The present invention provides isolated nucleic acid molecules comprising a 
polynucleotide encoding the polypeptide having the amino acid sequence shown in 
Figures 1 A-C (SEQ ID NO:29), which was determined by sequencing a cloned cDNA. 
The nucleotide sequence shown in Figures 1A-C (SEQ ID NO:l 1) was obtained by 
sequencing a cloned cDNA, which was deposited on Nov. 17, 1998 at the American 
Type Culture Collection, and given Accession Number 203484. The deposited gene is 
inserted in the pSport plasmid (Life Technologies, Rockville, MD) using the Sall/NotI 
restriction endonuclease cleavage sites. 

The present invention is further directed to fragments of the isolated nucleic acid 
molecules described herein. By a fragment of an isolated DNA molecule having the 
nucleotide sequence of the deposited cDNA or the nucleotide sequence shown in SEQ ID 
NO:l 1 is intended DNA fragments at least about 15nt, and more preferably at least about 
20 nt, still more preferably at least about 30 nt, and even more preferably, at least about 
40 nt in length which are useful as diagnostic probes and primers as discussed herein. Of 
course, larger fragments 50-1500 nt in length are also useful according to the present 
invention, as are fragments corresponding to most, if not all, of the nucleotide sequence 
of the deposited cDNA or as shown in SEQ ID NO: 1 1. By a fragment at least 20 nt in 
length, for example, is intended fragments which include 20 or more contiguous bases 
from the nucleotide sequence of the deposited cDNA or the nucleotide sequence as 
shown in SEQ ID NO: 11. In this context "about" includes the particularly recited size, 
larger or smaller by several (5, 4, 3, 2, or 1) nucleotides, at either terminus or at both 
termini. Representative examples of polynucleotide fragments of the invention include, 
for example, fragments that comprise, or alternatively, consist of, a sequence from about 
nucleotide 1 to about 50, from about 51 to about 100, from about 101 to about 150, from 
about 151 to about 200, from about 201 to about 250, from about 251 to about 300, from 
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about 301 to about 350, from about 351 to about 400, from about 401 to about 450, from 
about 451 to about 500, and from about 501 to about 550, and from about 551 to about 
570 of SEQ ID NO: 1 1, or the complementary strand thereto, or the cDNA contained in 
the deposited gene. In this context "about" includes the particularly recited ranges, larger 
or smaller by several (5, 4, 3, 2, or 1) nucleotides, at either terminus or at both termini. In 
additional embodiments, the polynucleotides of the invention encode functional 
attributes of the corresponding protein. 

Preferred embodiments of the invention in this regard include fragments that 
comprise alpha-helix and alpha-helix forming regions ("alpha-regions"), beta-sheet and 
beta-sheet forming regions ("beta-regions"), turn and turn-forming regions ("turn- 
regions"), coil and coil-forming regions ("coil-regions"), hydrophilic regions, 
hydrophobic regions, alpha amphipathic regions, beta amphipathic regions, flexible 
regions, surface-forming regions and high antigenic index regions. The data representing 
the structural or functional attributes of the protein set forth in Figure 3 and/or Table I, as 
described above, was generated using the various modules and algorithms of the 
DNA*STAR set on default parameters. In a preferred embodiment, the data presented in 
columns VIII, IX, XIII, and XIV of Table I can be used to determine regions of the 
protein which exhibit a high degree of potential for antigenicity. Regions of high 
antigenicity are determined from the data presented in columns VIII, IX, XIII, anchor 
XIV by choosing values which represent regions of the polypeptide which are likely to 
be exposed on the surface of the polypeptide in an environment in which antigen 
recognition may occur in the process of initiation of an immune response. 

Certain preferred regions in these regards are set out in Figure 3, but may, as 
shown in Table I, be represented or identified by using tabular representations of the data 
presented in Figure 3. The DNA*STAR computer algorithm used to generate Figure 3 
(set on the originai default parameters) was used to present the data in Figure 3 in a 
tabular format (See Table I). The tabular format of the data in Figure 3 is used to easily 
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determine specific boundaries of a preferred region. The above-mentioned preferred 
regions set out in Figure 3 and in Table I include, but are not limited to, regions of the 
aforementioned types identified by analysis of the amino acid sequence set out in Figures 
1 A-C (SEQ ID NO:29). As set out in Figure 3 and in Table I, such preferred regions 
include Garni er-Robson alpha-regions, beta-regions, turn-regions, and coil-regions, 
Chou-Fasman alpha-regions, beta-regions, and turn-regions, Kyte-Doolittle hydrophilic 
regions and Hopp-Woods hydrophobic regions, Eisenberg alpha- and beta-amphipathic 
regions, Karplus-Schulz flexible regions, Jameson-Wolf regions of high antigenic index 
and Emini surface-forming regions. Even if deletion of one or more amino acids from the 
N-terminus of a protein results in modification of loss of one or more biological 
functions of the protein, other functional activities (e.g., biological activities, ability to 
multimerize, etc.) may still be retained. For example, the ability of shortened muteins to 
induce and/or bind to antibodies which recognize the complete or mature forms of the 
polypeptides generally will be retained when less than the majority of the residues of the 
complete or mature polypeptide are removed from the N-terminus. Whether a particular 
polypeptide lacking N-terminal residues of a complete polypeptide retains such 
immunologic activities can readily be determined by routine methods described herein 
and otherwise known in the art. It is not unlikely that a mutein with a large number of 
deleted N-terminal amino acid residues may retain some biological or immunogenic 
activities. In fact, peptides composed of as few as six amino acid residues may often 
evoke an immune response. 

Accordingly, the present invention further provides polypeptides having one or 
more residues deleted from the amino terminus of the amino acid sequence shown in 
Figures 1 A-C, up to the alanine residue at position number 524 and polynucleotides 
encoding such polypeptides. In particular, the present invention provides polypeptides 
comprising the amino acid sequence of residues nl-524 of Figures 1A-C, where nl is an 
integer from 1 to 524 corresponding to the position of the amino acid residue in Figures 
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1 A-C (which is identical to the sequence shown as SEQ ID NO:29). N-terminal deletions 
of the polypeptide of the invention shown as SEQ ID NO:29 include polypeptides 
comprising the amino acid sequence of residues: V-2 to P-529; A-3 to P-529; Q-4 to P- 
529; D-5 to P-529; P-6 to P-529; Q-7 to P-529; G-8 to P-529; C-9 to P-529; L-10 to P- 
529; Q-ll to P-529; L-12 to P-529; C-13 to P-529; L-14 to P-529; S-15 to P-529; E-16 to 
P-529; V-17 to P-529; A-18 to P-529; N-19 to P-529; G-20 to P-529; L-21 to P-529; R- 
22 to P-529; N-23 to P-529; P-24 to P-529; V-25 to P-529; S-26 to P-529; M-27 to P- 
529; V-28 to P-529; H-29 to P-529; A-30 to P-529; G-31 to P-529; D-32 to P-529; G-33 
to P-529; T-34 to P-529; H-35 to P-529; R-36 to P-529; F-37 to P-529; F-38 to P-529; V- 
39 to P-529; A-40 to P 529; E-41 to P-529; Q-42 to P-529; V-43 to P-529; G-44 to P- 
529; V-45 to P-529; V-46 to P-529; W-47 to P-529; V-48 to P-529; Y-49 to P-529; L-50 
to P-529; P-51 to P-529; D-52 to P-529; G-53 to P-529; S-54 to P-529; R-55 to P-529; L- 
56 to P-529; E-57 to P-529; Q-58 to P-529; P-59 to P-529; F-60 to P-529; L-61 to P-529; 
D-62 to P-529; L-63 to P-529; K-64 to P-529; N-65 to P-529; 1-66 to P-529; V-67 to P- 
529; L-68 to P-529; T-69 to P-529; T-70 to P-529; P-71 to P-529; W-72 to P-529; 1-73 to 
P-529; G-74 to P-529; D-75 to P-529; E-76 to P-529; R-77 to P-529; G-78 to P-529; F- 
79 to P-529; L-80 to P-529; G-81 to P-529; L-82 to P-529; A-83 to P-529; F-84 to P-529; 
H-85 to P-529; P-86 to P-529; K-87 to P-529; F-88 to P-529; R-89 to P-529; H-90 to P- 
529; N-91 to P-529; R-92 to P-529; K-93 to P-529; F-94 to P-529; Y-95 to P-529; 1-96 to 
P-529; Y-97 to P-529; Y-98 to P-529; S-99 to P-529; C-100 to P-529; L-101 to P-529; 
D-102 to P-529; K-103 to P-529; K-104 to P-529; K-105 to P-529; V-106 to P-529; E- 
107 to P-529; K-108 to P-529; 1-109 to P-529; R-110 to P-529; Mil to P-529; S-112 to 
P-529; E-113 to P-529; M-114 to P-529; K-115 to P-529; V-116 to P-529; S-117 to P- 
529; R-l 18 to P-529; A-l 19 to P-529; D-120 to P-529; P-121 to P-529; N-122 to P-529; 
K-123 to P-529; A-124 to P-529; D-125 to P-529; L-126 to P-529; K-127 to P-529; S- 
128 to P-529; E-129 to P-529; R-130 to P-529; V-131 to P-529; 1-132 to P-529; L-133 to 
P-529; E-134 to P-529; 1-135 to P-529; E-136 to P-529; E-137 to P-529; P-138 to P-529; 
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A-139 to P-529; S-140 to P-529; N-141 to P-529; H-142 to P-529; N-143 to P-529; G- 
144 to P-529; G-145 to P-529; Q-146 to P-529; L-147 to P-529; L-148 to P-529; F-149 
to P-529; G-150 to P-529; L-151 to P-529; D-152 to P-529; G-153 to P-529; Y-154 to P- 
529; M-155 to P-529; Y-156 to P-529; 1-157 to P-529; F-158 to P-529; T-159 to P-529; 
5 G-160 to P-529; D-161 to P-529; G-162 to P-529; G-163 to P-529; Q-164 to P-529; A- 
165 to P-529; G-166 to P-529; D-167 to P-529; P-168 to P-529; F-169 to P-529; G-170 
to P-529; L-171 to P-529; F-172 to P-529; G-173 to P-529; N-174 to P-529; A-175 to P- 
529; Q-176 to P-529; N-177 to P-529; K-178 to P-529; S-179 to P-529; S-180 to P-529; 
L-181 to P-529; L-182 to P-529; G-183 to P-529: K-184 to P-529; V-185 to P-529; L- 

10 186 to P-529; R 187 to P-529; 1-188 to P-529; D-189 to P-529; V-190 to P-529; N-191 to 
P-529; R-192 to P-529; A- 193 to P-529; G-194 to P-529; S-195 to P-529; H-196 to P- 
529; G-197 to P-529; K-198 to P-529; R-199 to P-529; Y-200 to P-529; R-201 to P-529; 
V-202 to P-529; P-203 to P-529; S-204 to P-529; D-205 to P-529; N-206 to P-529; P-207 
to P-529; F-208 to P-529; V-209 to P-529; S-210 to P-529; E-21 1 to P-529; P-212 to P- 

15 529; G-213 to P-529; A-214 to P-529; H-215 to P-529; P-216 to P-529; A-217 to P-529; 
1-218 to P-529; Y-219 to P-529; A-220 to P-529; Y-221 to P-529; G-222 to P-529; 1-223 
to P-529; R-224 to P-529; N-225 to P-529; M-226 to P-529; W-227 to P-529; R-228 to 
P-529; C-229 to P-529; A-230 to P-529; V-231 to P-529; D-232 to P-529; R-233 to P- 
529; G-234 to P-529; D-235 to P-529; P-236 to P-529; 1-237 to P-529; T-238 to P-529; 

20 R-239 to P-529; Q-240 to P-529; G-241 to P-529; R-242 to P-529; G-243 to P-529; R- 

244 to P-529; 1-245 to P-529; F-246 to P-529; C-247 to P-529; G-248 to P-529; D-249 to 
P-529; V-250 to P-529; G-251 to P-529; Q-252 to P-529; N-253 to P-529; R-254 to P- 
529; F-255 to P-529; E-256 to P-529; E-257 to P-529; V-258 to P-529; D-259 to P-529; 
L-260 to P-529; 1-261 to P-529; L-262 to P-529; K-263 to P-529; G-264 to P-529; G-265 

25 to P-529; N-266 to P-529; Y-267 to P-529; G-268 to P-529; W-269 to P-529; R-270 to P- 
529; A-271 to P-529; K-272 to P-529; E-273 to P-529; G-274 to P-529; F-275 to P-529; 
A-276 to P-529; C-277 to P-529; Y-278 to P-529; D-279 to P-529; K-280 to P-529; K~ 
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281 to P-529; L-282 to P-529; C-283 to P-529; H-284 to P-529; N-285 to P-529; A-286 
to P-529; S-287 to P-529; L-288 to P-529; D-289 to P-529; D-290 to P-529; V-291 to P- 
529; L-292 to P-529; P-293 to P-529; 1-294 to P-529; Y-295 to P-529; A-296 to P-529; 
Y-297 to P-529; G-298 to P-529; H-299 to P-529; A-300 to P-529; V-301 to P-529; G- 
302 to P-529; K-303 to P-529; S-304 to P-529; V-305 to P-529; T-306 to P-529; G-307 
to P-529; G-308 to P-529; Y-309 to P-529; V-310 to P-529; Y-311 to P-529; R-312 to P~ 
529; G-313 to P-529; C-314 to P-529; E-315 to P-529; S-316 to P-529; P-317 to P-529; 
N-318 to P-529; L-319 to P-529; N-320 to P-529; G-321 to P-529; L-322 to P-529; Y- 
323 to P 529; 1-324 to P-529; F-325 to P-529; G-326 to P-529; D-327 to P-529; F-328 to 
P-529; M-329 to P-529; S-330 to P-529; G-331 to P-529; R-332 to P-529; L-333 to P^ 
529; M-334 to P-529; A-335 to P-529; L-336 to P-529; Q-337 to P-529; E-338 to P-529; 
D-339 to P-529; R-340 to P-529; K-341 to P-529; N-342 to P-529; K-343 to P-529; K- 
344 to P-529; W-345 to P-529; K-346 to P-529; K-347 to P-529; Q-348 to P-529; D-349 
to P-529; L-350 to P-529; C-351 to P-529; L-352 to P-529; G-353 to P-529; S-354 to P- 
529; T-355 to P-529; T-356 to P-529; S-357 to P-529; C-358 to P-529; A-359 to P-529; 
F-360 to P-529; P-361 to P-529; G-362 to P-529; L-363 to P-529; 1-364 to P-529; S-365 
to P-529; T-366 to P-529; H-367 to P-529; S-368 to P-529; K-369 to P-529; F-370 to P- 
529; 1-371 to P-529; 1-372 to P-529; S-373 to P-529; F-374 to P-529; A-375 to P-529; E- 
376 to P-529; D-377 to P-529; E-378 to P-529; A-379 to P-529; G-380 to P-529; E-381 
to P-529; L-382 to P-529; Y-383 to P-529; F-384 to P-529; L-385 to P-529; A-386 to P- 
529; T-387 to P-529; S-388 to P-529; Y-389 to P-529; P-390 to P-529; S-391 to P-529; 
A-392 to P-529; Y-393 to P-529; A-394 to P-529; P-395 to P-529; R-396 to P-529; G- 
397 to P-529; S-398 to P-529; 1-399 to P-529; Y-400 to P-529; K-401 to P-529; F-402 to 
P-529; V-403 to P-529; D-404 to P-529; P-405 to P-529; S-406 to P-529; R-407 to P- 
529; R-408 to P-529; A-409 to P-529; P-410 to P-529; P-41 1 to P-529; G-412 to P-529; 
K-413 to P-529; C-414 to P-529; K-415 to P-529; Y-416 to P-529; K-417 to P-529; P- 
418 to P-529; V-419 to P-529; P-420 to P-529; V-421 to P-529; R-422 to P-529; T-423 
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to P-529; K-424 to P-529; S-425 to P-529; K-426 to P-529; R-427 to P-529; 1-428 to P- 
529; P-429 to P-529; F-430 to P-529; R-431 to P-529; P-432 to P-529; L-433 to P-529; 
A-434 to P-529; K-435 to P-529; T-436 to P-529; V-437 to P-529; L-438 to P-529; D- 
439 to P-529; L-440 to P-529; L-441 to P-529; K-442 to P-529; E-443 to P-529; Q-444 
to P-529; S-445 to P-529; E-446 to P-529; K-447 to P-529; A-448 to P-529; A-449 to P- 
529; R-450 to P-529; K-451 to P-529; S-452 to P-529; S-453 to P-529; S-454 to P-529; 
A-455 to P-529; T-456 to P-529; L-457 to P-529; A-458 to P-529; S-459 to P-529; G- 
460 to P-529; P-461 to P-529; A-462 to P-529; Q-463 to P-529; G-464 to P-529; L-465 
to P-529; S-466 to P-529; E-467 to P-529; K-468 to P-529; G-469 to P-529; S-470 to P- 
529; S-471 to P-529; K-472 to P-529; K-473 to P-529; L-474 to P-529; A-475 to P-529; 
S-476 to P-529; P-477 to P-529; T-478 to P-529; S-479 to P-529; S-480 to P-529; K-481 
to P-529; N-482 to P-529; T-483 to P-529; L-484 to P-529; R-485 to P-529; G-486 to P- 
529; P-487 to P-529; G-488 to P-529; T-489 to P-529; K-490 to P-529; K-491 to P-529; 
K-492 to P-529; A-493 to P-529; R-494 to P-529; V-495 to P-529; G-496 to P-529; P- 
497 to P-529; H-498 to P-529; V-499 to P-529; R-500 to P-529; Q-501 to P-529; G-502 
to P-529; K-503 to P-529; R-504 to P-529; R-505 to P-529; K-506 to P-529; S-507 to P- 
529; L-508 to P529; K-509 to P-529; S-510 to P-529; H-51 1 to P-529; S-5I2 to P-529; 
G-513 to P-529; R-514 to P-529; M-515 to P-529; R-516 to P-529; P-517 to P-529; S- 
518 to P-529; A-519 to P-529; E-520 to P-529; Q-521 to P-529; K-522 to P-529; R-523 
to P-529; A-524 to P-529; of SEQ ID NO:29. Polypeptides encoded by these 
polynucleotides are also encompassed by the invention. 

Also as mentioned above, even if deletion of one or more amino acids from the 
C-terminus of a protein results in modification or loss of one or more biological 
functions of the protein, other functional activities (e.g., biological activities (e.g., ability 
to illicit mitogenic activity, induce differentiation of normal or malignant cells, bind to 
EGF receptors, etc.)), may still be retained. For example the ability to induce and/or bind 
to antibodies which recognize the complete or mature forms of the polypeptide generally 
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will be retained when less than the majority of the residues of the compiete or mature 
polypeptide are removed from the C-terminus. Whether a particular polypeptide lacking 
C-terminal residues of a complete polypeptide retains such immunologic activities can 
readily be determined by routine methods described herein and otherwise known in the 
art. It is not unlikely that a mutein with a large number of deleted C-terminal amino acid 
residues may retain some biological or immunogenic activities. In fact, peptides 
composed of as few as six amino acid residues may often evoke an immune response. 

Accordingly, the present invention further provides polypeptides having one or 
more residues deleted from the carboxy terminus of the amino acid sequence of the 
polypeptide shown in Figures 1 A-C, up to the glutamine residue at position number 7, 
and polynucleotides encoding such polypeptides. In particular, the present invention 
provides polypeptides comprising the amino acid sequence of residues 1-ml of Figures 
1 A-C, where ml is an integer from 7 to 528 corresponding to the position of the amino 
acid residue in Figures 1AC. Moreover, the invention provides polynucleotides 
encoding polypeptides comprising, or alternatively consisting of, the amino acid 
sequence of C-terminal deletions of the polypeptide of the invention shown as SEQ ID 
NO:29 include polypeptides comprising the amino acid sequence of residues: M-l to L- 
528; M-l to S-527; M-l to R-526; M-l to G-525; M-l to A-524; M-l to R-523; M-l to 
K-522; M-l to Q-521; M-l to E-520; M-l to A-519; M-l to S-518; M-l to P-517; M-l to 
R-516; M-l to M-515; M-l to R-514; M-l to G~513;M-1 to S-512; M-l toH-511;M-l 
to S-510; M-l to K-509; M-l to L-508; M-l to S-507; M-l to K-506; M-l to R-505; M-l 
to R-504; M-l to K-503; M-l to G-502; M-l to Q-501; M-l to R-500; M-l to V-499; M- 
1 to H-498; M-l to P-497; M-l to G-496; M-l to V-495; M-l to R-494; M-l to A-493; 
M-l to K-492; M-l to K-491; M-l to K-490; M-I to T-489; M-l to G-488; M-l to P- 
487; M-l to G-486; M-l to R-485; M-l to L-484; M-l to T-483; M-l to N-482; M-l to 
K-481; M-l to S-480; M-l to S-479; M-l to T-478; M-l to P-477; M-l to S-476; M-l to 
A-475; M-l to L-474; M-l to K-473; M-l to K-472; M-l to S-471; M-l to S-470; M-l to 
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G-469; M-l to K-468; M-l to E-467; M-l to S-466; M-l to L-465; M-l to G-464; M-I 
to Q-463; M-l to A-462; M-l to P-461; M-l to G-460; M-l to S-459; M-l to A-458; M- 
1 to L-457; M-l to T-456; M-l to A-455; M-l to S-454; M-l to S-453; M-l to S-452; M~ 
1 to K-451; M-l to R-450; M-l to A-449; M-l to A-448; M-l to K-447; M-l to E-446; 
M-l to S-445; M-l to Q-444; M-l to E-443; M-l to K-442; M-l to L-441; M-l to L-440; 
M-l to D-439; M-l to L-438; M-l to V-437; M-l to T-436; M-l to K-435; M-l to A- 
434; M-l to L-433; M-l to P-432; M-l to R-431; M-l to F-430; M-i to P-429; M-l to I- 
428; M-l to R-427; M-l to K-426; M-l to S-425; M-l to K-424; M-l to T-423; M-l to 
R-422; M-l to V-421; M-l to P-420; M-l to V-419; M-l to P-418; M-l to K-417; M-l 
to Y-416; M-l to K-415; M-l to C-414; M-l to K-413; M-l to G-412; M-l to P-411; M- 
1 to P-410; M-l to A-409; M-l to R-408; M-l to R-407; M-l to S-406; M-l to P-405; 
M-l to D-404; M-l to V-403; M-l to F-402; M-l to K-401; M-l to Y-400; M-l to 1-399; 
M-l to S-398; M-l to G-397; M-l to R-396; M-l to P-395; M-l to A-394; M-l to Y 
393; M-l to A-392; M-l to S-391; M-l to P-390; M-l to Y-389; M-l to S-388; M-l to 
T-387; M-l to A-386; M-l to L-385; M-l to F-384; M-l to Y-383; M-l to L-382; M-l to 
E-381; M-l to G-380; M-l to A-379; M-l to E-378; M-l to D-377; M-l to E-376; M-l 
to A-375; M-l to F-374; M-l to S-373; M-l to 1-372; M-l to 1-371; M-l to F-370; M-l 
to K-369; M-l to S-368; M-l to H-367; M-l to T-366; M-l to S-365; M-l to 1-364; M-l 
to L-363; M-l to G-362; M-l to P-361; M-l to F-360; M-l to A-359; M-l to C-358; M-l 
to S-357; M-l to T-356; M-l to T-355; M-l to S-354; M-l to G-353; M-l to L-352; M-l 
to C-351; M l to L-350; M-l to D-349; M-l to Q-348; M-l to K-347; M-l to K-346; M- 
1 to W-345; M-l to K-344; M-l to K-343; M-l to N-342; M-l to K-341; M-l to R-340; 
M-l to D-339; M-l to E-338; M-l to Q-337; M-l to L-336; M-l to A-335; M-l to M- 
334; M-l to L-333; M-l to R-332; M-l to G-331; M-l to S-330; M-l to M-329; M-l to 
F-328; M-l to D-327; M-l to G-326; M-l to F-325; M-l to 1-324; M-l to Y-323; M-l to 
L-322; M-l to G-321; M-l to N-320; M-l to L-319; M-l to N-318; M-l to P-317; M-l 
to S-316; M-l to E-315; M-l to C-314; M-l to G-313; M-l to R-312; M-l to Y-311; M- 
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1 to V-310; M-l to Y-309; M-l to G-308; M-I to G-307; M-l to T-306; M-l to V-305; 
M-l to S-304; M-l to K-303; M-l to G-302; M-l to V-301; M-l to A-300; M-l to H- 
299; M-l to G-298; M-l to Y-297; M-l to A-296; M-l to Y-295; M-l to 1-294; M-l to 
P-293; M-l to L-292; M-l to V-291; M-l to D-290; M-l to D-289; M-l to L-288; M-l 
to S-287; M-l to A-286; M-l to N-285; M-l to H-284; M-l to C-283; M-l to L-282; M- 
I to K-281; M-l to K-280; M-l to D-279; M-l to Y-278; M-l to C-277; M-l to A-276; 
M-l to F-275; M-l to G-274; M-l to E-273; M-l to K-272; M-l to A-271; M-l to R- 
270; M-l to W-269; M-l to G-268; M-l to Y-267; M-l to N-266; M-l to G-265; M-l to 
G-264; M-l to K-263; M-l to L-262; M-l to 1-261; M-l to L-260; M-l to D-259; M-l to 
V-258; M-l to E-257; M-l to E-256; M-l to F-255; M-l to R-254; M-l to N-253i M-l to 
Q-252; M-l to G-251; M-l to V-250; M-l to D-249; M-l to G-248; M-l to C-247; M-l 
to F-246; M-l to 1-245; M-l to R-244; M-l to G-243; M-l to R-242; M-l to G-241; M-l 
to Q-240; M-l to R-239; M-l to T-238; M-l to 1-237; M-l to P-236; M-l to D-235; M-l 
to G-234; M-l to R-233; M-l to D-232; M-l to V-231; M-l to A-230; M-l to C-229; M- 
1 to R-228; M-l to W-227; M-l to M-226; M-l to N-225; M-l to R-224; M-l to 1-223; 
M-l to G-222; M-l to Y-221; M-l to A-220; M-l to Y-219; M-l to 1-218; M-l to A- 
217; M-l to P-216; M-l to H-215; M-l to A-214; M-l to G-213; M-l to P-212; M-l to 
E-211; M-l to S-210; M-l to V-209; M-l to F-208; M-l to P-207; M-l to N-206; M-l to 
D-205; M-l to S-204; M-l to P-203; M-l to V-202; M-l to R-201; M-l to Y-200; M-I 
to R-199; M-l to K-198; M-l to G-197; M-l to H-196; M-l to S-195; M-l to G-194; M- 
1 to A-193; M-l to R-192; M-l to N-191; M-l to V-190; M-l to D-189; M-l to 1-188; 
M-l to R-187;M-1 toL-186;M-l to V-185; M-l to K-184; M-l to G-183;M-i toL- 
182; M-l to L-181; M-l to S-180; M-l to S-179; M-l to K-178; M-l to N-177; M-l to 
Q-176; M-l to A-175; M-l to N-174; M-l to G-173; M-l to F-172; M-l to L-171; M-l 
to G-170; M-l toF-169; M-l to P-168; M-l to D-167; M-l to G-166; M-l to A-165; M- 
1 to Q-164; M-l to G-163; M-l to G-162; M l to D-161;M-1 to G-160; M-l toT-159; 
M-l to F-158; M-l to 1-157; M-l to Y-156; M-l to M-155; M-l to Y-154; M-l to G- 
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153; M-l to D-152; M-l to L-151; M-l to G-150; M-i to F-149; M-l to L-148; M-l to 
L-147; M-l to Q-146; M-l to G-145; M-l to G-144; M-l to N-143; M-l to H-142; M-l 
to N-141; M-l to S-140; M-l to A-139; M-l to P-138; M-l to E-137; M-l to E-136; M-l 
to 1-135; M-l to E-134; M~l to L-133;M-1 to 1-132; M-l to V-131; M-l to R-130; M-l 
to E-129; M-l to S-128; M-l to K-127; M-l to L-126; M-l to D-125; M-l to A-124; M- 
1 to K-123; M-l toN-122;M-l to P-121;M-1 toD-120; M-l toA-119; M-l toR-118; 
M-l to S-117;M-1 to V-U6; M.l to K-115;M-1 to M-l 14; M-l toE-I13;M-l to S- 
112; M-l toMll;M-l toR-110;M-l toM09; M-l to K-108; M-l toE-107;M-l to V- 
106; M-l to K-105;M-1 to K-104; M-l to K-103; M-l to D-102;M-1 to L-101; M-l to 
C-100; M-l to S-99; M-l to Y-98; M-l to Y-97; M-l to 1-96; M-l to Y-95; M-l to F-94; 
M-l to K-93; M-l to R-92; M-l to N-91; M-l to H-90; M-l to R-89; M-l to F-88; M-l 
to K-87; M-l to P-86; M-l to H-85; M-l to F-84; M-l to A-83; M-l to L-82; M-l to G- 
81; M-l to L-80; M-l to F-79; M-l to G-78; M-l to R-77; M-l to E-76; M-l to D-75; M- 
1 to G-74; M-l to 1-73; M-l to W-72; M-l to P-71 ; M-l to T-70; M-l to T-69; M-l to L- 
68; M-l to V-67; M-l to 1-66; M-l to N-65; M-l to K-64; M-l to L-63; M-l to D-62; M- 
1 to L-61; M-l to F-60; M-l to P-59; M-l to Q-58; M-l to E-57; M-l to L-56; M-l to R- 
55; M-l to S-54; M-l to G-53; M-l to D-52; M-l to P-51; M-l to L-50; M-l to Y-49; M- 
1 to V-48; M-l to W-47; M-l to V-46; M-l to V-45; M-l to G-44; M-l to V-43; M-l to 
Q-42; M-l to E-41; M-l to A-40; M-l to V-39; M-l to F-38; M-l to F-37; M-l to R-36; 
M-l to H-35; M-l to T-34; M-l to G-33; M-l to D-32; M-l to G-31; M-l to A-30; M-l 
to H-29; M-l to V-28; M-l to M-27; M-l to S-26; M-l to V-25; M-l to P-24; M-l to N- 
23; M-l to R-22; M-l to L-21; M-l to G-20; M-l to N-19; M-l to A-18; M-l to V-17; 
M-l toE-16;M-l toS-15; M-l to L-14; M-l to C-13; M-l toL-12;M-l to Q-11;M-1 to 
L-10; M-l to C-9; M-l to G-8; M-l to Q-7; of SEQ ID NO:29. Polypeptides encoded by 
these polynucleotides are also encompassed by the invention. 

Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differentia] identification of the tissue(s) or cell type(s) present in a 
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biological sample and for diagnosis of diseases and conditions which include, but are not 
limited to, developmental disorders, and degenerative disorders; osteoarthritis, and lung 
cancer. Similarly, polypeptides and antibodies directed to these polypeptides are useful in 
providing immunological probes for differential identification of the tissue(s) or cell 
type(s). For a number of disorders of the above tissues or cells, particularly of developing 
tissues, cartilage, and bone, expression of this gene at significantly higher or lower levels 
is routinely detected in certain tissues or cell types (e.g. bone, lung, cancerous and 
wounded tissues) or bodily fluids (e.g., lymph, serum, plasma, urine, synovial fluid and 
spinal fluid) or another tissue or cell sample taken from an individual having such a 
disorder, relative to the standard gene expression level, i.e., the expression level in 
healthy tissue or bodily fluid from an individual not having the disorder. 

Preferred polypeptides of the present invention comprise immunogenic epitopes 
shown in SEQ ID NO: 29 as residues: Asp-52 to Glu-57, Arg-89 to Tyr-95, Asp-102 to 
Glu-107, Ser-117 to Ser-128, Glu-137 to Gly-145, Arg-192 to Arg-199, Val-231 to Gly- 
243, Val-250 to Glu-256, Arg-3 12 to Asn-318, Giu-338 to Asp-349, Pro-405 to Lys-417, 
Thr-423 to Ile-428, Lys-442 to Ser-453, Glu-467 to Ala-475, Thr-478 to Arg-494, Pro- 
497 to Arg-526. Polynucleotides encoding said polypeptides are also provided. 

Many polynucleotide sequences, such as EST sequences, are publicly available 
and accessible through sequence databases. Some of these sequences are related to SEQ 
ID NO: 11 and may have been publicly available prior to conception of the present 
invention. Preferably, such related polynucleotides are specifically excluded from the 
scope of the present invention. To list every related sequence is cumbersome. 
Accordingly, preferably excluded from the present invention are one or more 
polynucleotides comprising a nucleotide sequence described by the general formula of a- 
b, where a is any integer between 1 to 2595 of SEQ ID NO: 1 1 , b is an integer of 1 5 to 
2609, where both a and b correspond to the positions of nucleotide residues shown in 
SEQ ID NO: 1 1, and where b is greater than or equal to a + 14. 
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FEATURES OF PROTEIN ENCODED BY GENE NO: 2 

The translation product of this gene, sometimes referred to herein as TIDE (for 
Ten Integrin Domains with EGF homology), shares sequence homology with integrins, 
which are a superfamily of dimeric ab cell-surface glycoproteins that mediate the 
adhesive functions of many cell types, enabling cells to interact with one another and 
with the extracellular matrix (See Genomics 56, 169-178 (1999); all information and 
references contained within this publication are hereby incorporated herein by reference). 
Eight human integrin b subunits have been described to date, and in combination with the 
12 known a subunits form a large family of heterodimeric cell surface receptors that 
mediate cell adhesion to counter-receptors on neighboring cells, and to ECM proteins 
(reviewed by Hynes, 1992). Integrin-ligand interactions are crucial for fundamental 
biological processes such as cell migration and motility, and lymphocyte extravasation. 

In another embodiment, polypeptides comprising the amino acid sequence of the 
open reading frame upstream of the predicted signal peptide are contemplated by the 
present invention. Specifically, polypeptides of the invention comprise the following 
amino acid sequence: 

TSTPPRAVPLPKSSQAAHQRNCNSGWSPGPASLGVRGSVCPAICWWHLS 

LLPPPSVNPTLQKCSSPGAAQELSMRPPGFRNFLLLASSLLFAGLSAVPQSFSPSLR 

SWPGAACRLSRAESERRCRAPGQPPGAALCHGRGRCDCGVCICHVTEPGMFFGP 

LCECHEWVCETYDGSTCAGHGKCDCGKCKCDQGWYGDACQYPTNCDLTKKK 

SNQMCKNSQDIICSNAGTCHCGRCKCDNSDGSGLVYGKFCECDDRECIDDETEEI 

CGGHGKCYCGNCYCKAGWHGDKCEFQCDITPWESKRRCTSPDGKICSNRGTCV 

CGECTCHDVDPTGDWGDIHGDTCECDERDCRAVYDRYSDDFCSGHGQCNCGR 

CDCKAGWYGKKCEHPQSCTLSAEESIRKCQGSSDLPCSGRGKCECGKCTCYPPG 

DRRVYGKTCECDDRRCEDLDGVVCGGHGTCSCGRCVCERGWFGKLCQHPRKC 
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NMTEEQSKNLCESADGILCSGKGSCHCGKCICSAEEWYISGEFCDCDDRDCDKH 
DGLICTGNGICSCGNCECWDGWNGNACEI WLGSEYP (SEQ ID NO:50). 
Polynucleotides encoding these polypeptides are also provided. 

Included in this invention as preferred domains are EGF-like domain signature 1 
and 2 domains, which were identified using the ProSite analysis tool (Swiss Institute of 
Bioinformatics). A sequence of about thirty to forty amino-acid residues long found in 
the sequence of epidermal growth factor (EGF) has been shown [1 to 6] to be present, in 
a more or less conserved form, in a large number of other, mostly animal proteins. The 
functional significance of EGF domains in what appear to be unrelated proteins is not yet 
clear. However, a common feature is that these repeats are found in the extracellular 
domain of membrane-bound proteins or in proteins known to be secreted (exception: 
prostaglandin G/H synthase). The EGF domain includes six cysteine residues which have 
been shown (in EGF) to be involved in disulfide bonds. The main structure is a two- 
stranded beta-sheet followed by a loop to a C-terminal short two-stranded sheet. 
Subdomains between the conserved cysteines strongly vary in length as shown in the 
following schematic representation of the EGF-like domain: 

+ + + + 

i ii i 

x(4)-C-x(0,48)-C-x(3,12)~C-x(l,70)-C-x(l,6)-C-x(2>-G~a-x(0,21>-G-x{2)-C-x 
| j ************************************ 


'C: conserved cysteine involved in a disulfide bond. 'G': often conserved glycine 'a*: 
often conserved aromatic amino acid '*': position of both patterns. V: any residue The 
region between the 5th and 6th cysteine contains two conserved glycines of which at 
least one is present in most EGF-like domains. The concensus pattern is as follows: C-x- 
C-x(5)-G-x(2)-C [The 3 C's are involved in disulfide bonds]. 

Preferred polypeptides of the invention comprise the following amino acid 
sequence: G KCDCGKCKCDQG W YGD A CQ YPTN CDLTK (SEQ ID NO: 51), 
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GGHGKCYCGNCYCKAGWHGDKCEFQCDIT (SEQ ID NO:52), 
HGQCNCGRCDCKAGWYGKKCEHPQSCTLS (SEQ ID NO: 53), 
HGTCSCGRCVCERGWFGKLCQHPRKCNMT (SEQ ID NO: 54), 
GNGICSC GNCEC WDG WNGN ACEI WLGSE Y (SEQ ID NO: 55), and 
ICGGHGKCYCGNCYCKAGWHGDKCEFQCDITPWESK (SEQ ID NO: 73). 
Polynucleotides encoding these polypeptides are also provided. 

Further preferred are polypeptides comprising the EGF-like domain signature 1 
and 2 domains of the sequence referenced in Table I for this gene, and at least 5, 10, 15, 
20, 25, 30, 50, or 75 additional contiguous amino acid residues of this referenced 
sequence. The additional contiguous amino acid residues is N-terminal orC- terminal to 
the EGF-like domain signature 1 and 2 domains. 

Alternatively, the additional contiguous amino acid residues is both N-terminal 
and C-terminal to the EGF-like domain signature 1 and 2 domains, wherein the total N- 
and C-terminal contiguous amino acid residues equal the specified number. The above 
preferred polypeptide domain is characteristic of a signature specific to EGF-like domain 
1 and 2 containing proteins. Based on the sequence similarity, the translation product of 
this gene is expected to share at least some biological activities with EGF-like containing 
proteins. Such activities are known in the art, some of which are described elsewhere 
herein. 

Included in this invention as preferred domains are integrins beta chain cysteine- 
rich domains, which were identified using the ProSite analysis tool (Swiss Institute of 
Bioinformatics). Integrins [7,8] are a large family of cell surface receptors that mediate 
cell to cell as well as cell to matrix adhesion. Some integrins recognize the R-G-D 
sequence in their extracellular matrix protein ligand. Structurally, integrins consist of a 
dimer of an alpha and a beta chain. Each subunit has a large N-terminal extracellular 
domain followed by a transmembrane domain and a short C-terminal cytoplasmic region. 
Some receptors share a common beta chain while having different alpha chains. All the 
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integrin beta chains contain four repeats of a forty amino acid region in the C-terminal 
extremity of their extracellular domain. Each of the repeats contains eight cysteines. The 
concensus pattern is as follows: C-x-[GNQ]-x(l,3)-G-x-C-x-C-x(2)-C-x-C [The five C's 
are probably involved in disulfide bonds]. 

Preferred polypeptides of the invention comprise the following amino acid 
sequence: GQPPGAALCHGRGRCDCGVCICHVTEPGMFFGPLC (SEQ ID NO: 74), 
ETYDGSTCAGHGKCDCGKCKCDQGWYGDACQYP (SEQ ID NO:58), 
MCKNSQDIICSNAGTCHCGRCKCDNSDGSGLVYG (SEQ ID NO:59), 
IDDETEEICGGHGKC Y CGNC YCK A GWHGDKC (SEQ ID NO:60), 
KRRCTSPDGKICSNRGTCVCGECTCHDVDPTGDW (SEQ ID NO:61), 
DRYSDDFCSGHGQCNCGRCDCKAGWYGKKCEHPQ (SEQ ID NO:62), 
CQGSSDLPCSGRGKCECGKCTCYPPGDRRVYGK (SEQ ID NO:63), 
CEDLDGVVCGGHGTCSCGRCVCERGWFGKLC (SEQ ID NO:64), 
SADGILCSGKGSCHCGKCICSAEEWYISGEFC (SEQ ID NO:65), and 
CDKHDGLICTGNGICSCGNCECWDGWNGNACEI (SEQ ID NO: 66). 
Polynucleotides encoding these polypeptides are also provided. 

Further preferred are polypeptides comprising the integrins beta chain cysteine- 
rich domain of the sequence referenced in Table XIII for this gene, and at least 5, 10, 15, 
20, 25, 30, 50, or 75 additional contiguous amino acid residues of this referenced 
sequence. The additional contiguous amino acid residues is N-terminal or terminal to 
the integrins beta chain cysteine-rich domain. 

Alternatively, the additional contiguous amino acid residues is both N-terminal 
and C-terminal to the integrins beta chain cysteine-rich domain, wherein the total N- and 
C-terminal contiguous amino acid residues equal the specified number. The above 
preferred polypeptide domain is characteristic of a signature specific to integrin proteins. 
Based on the sequence similarity, the translation product of this gene is expected to share 
at least some biological activities with integrin proteins, and specifically those containing 
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an integrins beta chain cysteine-rich domain. Such activities are known in the art, some 
of which are described elsewhere herein. The following publications were referenced 
above and are hereby incorporated herein by reference: [ 1] Davis C.G., New Biol. 
2:410-419(1990); [ 2] Blomquist M.C., Hunt L.T., Barker W.C., Proc. Natl. Acad. Sci. 
U.S.A. 81:7363-7367(1984); [ 3] Barker W.C., Johnson G.C., Hunt L.T., George D.G., 
Protein Nucl, Acid Enz. 29:54-68(1986); [ 4] Doolittle R.F., Feng D.F., Johnson M.S., 
Nature 307:558-560(1984); [ 5] Appella E., Weber I.T., Blasi F., FEBS Lett. 231:1- 
4(1988); [ 6] Campbell I.D., Bork P., Curr. Opin. Struct. Biol. 3:385-392(1993); [ 7] 
Hynes R.O., Cell 48:549-554(1987); and [ 8] Albelda SM, Buck C.A., FASEB J. 
4:2868-2880(1990). 

The polypeptide of the present invention has been putatively identified as a 
member of the integrin family and has been termed Ten Integrin Domains with EGF 
homology ("TIDE"). This identification has been made as a result of amino acid 
sequence homology to the human integrin beta-8 subunit (See Genbank Accession No. 
gi| 184521). 

Figures 4A-C shows the nucleotide (SEQ ID NO: 12) and deduced amino acid 
sequence (SEQ ID NO: 30) of TIDE. Predicted amino acids from about 1 to about 23 
constitute the predicted signal peptide (amino acid residues from about 1 to about 23 in 
SEQ ID NO: 30) and are represented by the underlined amino acid regions; amino acids 
from about 108 to about 136, from about 195 to about 223, from about 291 to about 319, 
from about 379 to about 407, and/or from about 465 to about 493 constitute the predicted 
EGF-Iike domain signature 1 and 2 domains (amino acids from about 108 to about 136, 
from about 195 to about 223, from about 291 to about 319, from about 379 to about 407, 
and/or from about 465 to about 493 in SEQ ID NO:30) and are represented by the double 
underlined amino acids; and amino acids from about 55 to about 89, from about 97 to 
about 129, from about 142 to about 175, from about 186 to about 216, from about 228 to 
about 261, from about 281 to about 314, from about 327 to about 359, from about 368 to 
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about 398, from about 417 to about 448, and/or from about 455 to about 487 constitute 
the predicted integrins beta chain cysteine-rich domains (amino acids from about 55 to 
about 89, from about 97 to about 129, from about 142 to about 175, from about 186 to 
about 216, from about 228 to about 261, from about 281 to about 314, from about 327 to 
about 359, from about 368 to about 398, from about 417 to about 448, and/or from about 
455 to about 487 in SEQ ID NO:30) and are represented by the shaded amino acids. 

Figure 5 shows the regions of similarity between the amino acid sequences of the 
Ten Integrin Domains with EGF homology (TIDE) protein (SEQ ID NO: 30) and the 
human integrin beta-8 subunit (SEQ ID NO: 67). 

Figure 6 shows an analysis of the Ten Integrin Domains with EGF homology 
(TIDE) amino acid sequence. Alpha, beta, turn and coil regions; hydrophilicity and 
hydrophobicity; amphipathic regions; flexible regions; antigenic index and surface 
probability are shown. 

A polynucleotide encoding a polypeptide of the present invention is obtained 
from human osteoblasts, synovial hypoxia tissue, osteoblast and osteoclast, bone marrow 
stromal cells, umbilical vein, smooth muscle, placenta, and fetal lung. The 
polynucleotide of this invention was discovered in a human osteoblast II cDNA library. 
Its translation product has homology to the characteristic integrins beta chain cysteine- 
rich domains of integrin family members. The polynucleotide contains an open reading 
frame encoding the TIDE polypeptide of 494 amino acids. TIDE exhibits a high degree 
of homology at the amino acid level to the human integrin beta-8 subunit (as shown in 
Figure 5). 

The present invention provides isolated nucleic acid molecules comprising a 
polynucleotide encoding the TIDE polypeptide having the amino acid sequence shown in 
Figures 4A-C (SEQ ID NO:30). The nucleotide sequence shown in Figures 4A-C (SEQ 
ID NO: 12) was obtained by sequencing a cloned cDNA (HOHCH55), which was 
deposited on November 17 at the American Type Culture Collection, and given 
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Accession Number 203484. The present invention is further directed to fragments of the 
isolated nucleic acid molecules described herein. By a fragment of an isolated DNA 
molecule having the nucleotide sequence of the deposited cDNA or the nucleotide 
sequence shown in SEQ ID NO: 12 is intended DNA fragments at least about 15nt, and 
more preferably at least about 20 nt, still more preferably at least about 30 nt, and even 
more preferably, at least about 40 nt in length which are useful as diagnostic probes and 
primers as discussed herein. Of course, larger fragments 50-1500 nt in length are also 
useful according to the present invention, as are fragments corresponding to most, if not 
all, of the nucleotide sequence of the deposited cDNA or as shown in SEQ ID NO: 12. By 
a fragment at least 20 nt in length, for example, is intended fragments which include 20 
or more contiguous bases from the nucleotide sequence of the deposited cDNA or the 
nucleotide sequence as shown in SEQ ID NO:12. In this context "about" includes the 
particularly recited size, larger or smaller by several (5, 4, 3, 2, or 1) nucleotides, at 
either terminus or at both termini. Representative examples of TIDE polynucleotide 
fragments of the invention include, for example, fragments that comprise, or 
alternatively, consist of, a sequence from about nucleotide 1 to about 50, from about 51 
to about 100, from about 101 to about 150, from about 151 to about 200, from about 201 
to about 250, from about 251 to about 300, from about 301 to about 350, from about 351 
to about 400, from about 401 to about 450, from about 451 to about 500, from about 501 
to about 550, from about 551 to about 600, from about 601 to about 650, from about 651 
to about 700, from about 701 to about 750, from about 751 to about 800, from about 801 
to about 850, from about 851 to about 900, from about 901 to about 950, from about 951 
to about 1000, from about 1001 to about 1050, from about 1051 to about 1 100, from 
about 1101 to about 1150, from about 1151 to about 1200, from about 1201 to about 
1250, from about 1251 to about 1300, from about 1301 to about 1350, from about 1351 
to about 1400, from about 1401 to about 1450, from about 1451 to about 1500, from 
about 1501 to about 1550, from about 1551 to about 1600, from about 1601 to about 
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1650, from about 1651 to about 1700, from about 1701 to about 1750, from about 1751 
to about 1800, from about 1801 to about 1850, from about 1851 to about 1900, from 
about 1901 to about 1950, from about 1951 to about 2000, from about 2001 to about 
2050, from about 2051 to about 2100, from about 2101 to about 2150, from about 2151 
5 to about 2200, from about 2201 to about 2250, from about 2251 to about 2300, from 
about 2301 to about 2350, from about 2351 to about 2400, from about 2401 to about 
2450, from about 245 1 to about 2499, from about 289 to about 1705, and/or from about 
221 to about 1705 of SEQ ID NO: 12, or the complementary strand thereto, or the cDNA 
contained in the deposited gene. In this context "about" includes the particularly recited 
10 ranges, larger or smaller by several (5, 4, 3, 2, or 1) nucleotides, at either terminus or at 
both termini. 

Preferred nucleic acid fragments of the present invention include nucleic acid 
molecules encoding a member selected from the group: a polypeptide comprising or 
alternatively, consisting of, the mature TIDE protein (amino acid residues from about 

15 221 to about 1705 in Figures 4A-C (amino acids from about 221 to about 1705 in SEQ 
ID NO: 30). Since the location of these domains have been predicted by computer 
analysis, one of ordinary skill would appreciate that the amino acid residues constituting 
these domains may vary slightly (e.g., by about 1 to 15 amino acid residues) depending 
on the criteria used to define each domain. In additional embodiments, the 

20 polynucleotides of the invention encode functional attributes of TIDE. 

Preferred embodiments of the invention in this regard include fragments that 
comprise alpha-helix and alpha-helix forming regions ("alpha-regions"), beta-sheet and 
beta-sheet forming regions ("beta-regions"), turn and turn-forming regions ("turn- 
regions"), coil and coil -forming regions ("coil-regions"), hydrophilic regions, 

25 hydrophobic regions, alpha amphipathic regions, beta amphipathic regions, flexible 
regions, surface-forming regions and high antigenic index regions of TIDE. The data 
representing the structural or functional attributes of TIDE set forth in Figure 6 and/or 
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Table II, as described above, was generated using the various modules and algorithms of 
the DNA*STAR set on default parameters. In a preferred embodiment, the data 
presented in columns VIII, IX, XIII, and XIV of Table II can be used to determine 
regions of TIDE which exhibit a high degree of potential for antigenicity. Regions of 
high antigenicity are determined from the data presented in columns VIII, IX, XIII, 
and/or XIV by choosing values which represent regions of the polypeptide which are 
likely to be exposed on the surface of the polypeptide in an environment in which antigen 
recognition may occur in the process of initiation of an immune response. 

Certain preferred regions in these regards are set out in Figure 6, but may, as 
shown in Table II, be represented or identified by using tabular representations of the 
data presented in Figure 6. The DNA*STAR computer algorithm used to generate Figure 
6 (set on the original default parameters) was used to present the data in Figure 6 in a 
tabular format (See Table II). The tabular format of the data in Figure 6 is used to easily 
determine specific boundaries of a preferred region. The above-mentioned preferred 
regions set out in Figure 6 and in Table II include, but are not limited to, regions of the 
aforementioned types identified by analysis of the amino acid sequence set out in Figures 
4A-C. As set out in Figure 6 and in Table II, such preferred regions include Garnier- 
Robson alpha-regions, beta-regions, turn-regions, and coil-regions, Chou-Fasman alpha- 
regions, beta-regions, and turn-regions, Kyte-Doolittle hydrophilic regions and Hopp- 
Woods hydrophobic regions, Eisenberg alpha- and beta-amphipathic regions, Karplus- 
Schulz flexible regions, Jameson-Wolf regions of high antigenic index and Emini 
surface-forming regions. Even if deletion of one or more amino acids from the N- 
terminus of a protein results in modification of loss of one or more biological functions 
of the protein, other functional activities (e.g., biological activities, ability to 
multimerize, etc.) may still be retained. For example, the ability of shortened TIDE 
muteins to induce and/or bind to antibodies which recognize the complete or mature 
forms of the polypeptides generally will be retained when less than the majority of the 
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residues of the complete or mature polypeptide are removed from the N-terminus. 
Whether a particular polypeptide lacking N-terminal residues of a complete polypeptide 
retains such immunologic activities can readily be determined by routine methods 
described herein and otherwise known in the art. It is not unlikely that an TIDE mutein 
with a large number of deleted N-terminal amino acid residues may retain some 
biological or immunogenic activities. In fact, peptides composed of as few as six TIDE 
amino acid residues may often evoke an immune response. 

Accordingly, the present invention further provides polypeptides having one or 
more residues deleted from the amino terminus of the TIDE amino acid sequence shown 
in Figures 4A-C, up to the leucine residue at position number 489 and polynucleotides 
encoding such polypeptides. In particular, the present invention provides polypeptides 
comprising the amino acid sequence of residues n 1-494 of Figures 4A-C, where nl is an 
integer from 2 to 489 corresponding to the position of the amino acid residue in Figures 
4A-C (which is identical to the sequence shown as SEQ ID NO;30). In another 
embodiment, N-terminal deletions of the TIDE polypeptide can be described by the 
general formula n2-494, where n2 is a number from 2 to 489, corresponding to the 
position of amino acid identified in Figures 4A-C. N-terminal deletions of the TIDE 
polypeptide of the invention shown as SEQ ID NO:30 include polypeptides comprising 
the amino acid sequence of residues: N-terminal deletions of the TIDE polypeptide of the 
invention shown as SEQ ID NO:30 include polypeptides comprising the amino acid 
sequence of residues: R-2 to P-494; P-3 to P-494; P-4 to P-494; G-5 to P-494; F-6 to P- 
494; R-7 to P-494; N-8 to P-494; F-9 toP-494; L-10 to P-494; L-I i to P-494; L-12 to P- 
494; A-I3 to P-494; S-14 to P-494; S-15 to P-494; L-16 to P-494; L-17 to P-494 ;F-18 to 
P-494; A-19 to P-494; G-20 to P-494; L-21 to P-494; S-22 to P-494; A-23 to P-494; V- 
24 to P-494; P-25 to P-494; Q~26to P-494; S-27 to P-494; F-28 to P-494; S-29 to P-494; 
P-30 to P-494; S-31 to P 494; L-32 to P-494; R-33 to P-494; S-34 toP-494; W-35 to P- 
494; P-36 to P-494; G-37 to P-494; A-38 to P-494; A-39 to P-494; C-40 to P-494; R-41 
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to P-494; L-42 toP-494; S-43 to P-494; R-44 to P-494; A-45 to P-494; E-46 to P-494; S- 
47 to P-494; E-4S to P-494; R-49 to P-494; R-50 toP-494; C-51 to P-494; R-52 to P-494; 
A-53 to P-494; P-54 to P-494; G-55 to P-494; Q-56 to P-494; P-57 to P-494; P-58 toP- 
494; G-59 to P-494; A-60 to P-494; A-61 to P-494; L-62 to P-494; C-63 to P-494; H-64 
to P-494; G-65 to P-494; R-66 toP-494; G-67 to P-494; R-68 to P-494; C-69 to P-494; D- 
70 to P-494; C-71 to P-494; G-72 to P-494; V-73 to P-494; C-74 toP-494; 1-75 to P-494; 
C-76 to P-494; H-77 to P-494; V-78 to P-494; T-79 to P-494; E-80 to P-494; P-8J to P- 
494; G-82 to P-494;M-83 to P-494; F-84 to P-494; F-85 to P-494; G-86 to P-494; P-87 to 
p-494; L-88 to P-494; C-89 to P-494; E-90 to P-494; C-91to P-494; H-92 to P-494; E~93 
to P-494; W-94 to P-494; V-95 to P-494; C-96 to P-494; E-97 to P-494; T-98 to P-494; 
Y-99 toP-494; D-100 to P-494; G-101 to P-494; S-102 to P-494; T-103 to P-494; C-104 
to P-494; A-105 to P-494; G-106 to P-494;H-I07 to P-494; G-108 to P-494; K-109 to P- 
494; C-i 10 to P-494; D-l 1 1 to P-494; CM 12 to P-494; G-l 13 to P-494; K-l 14 toP-494; 
C-l 15 to P-494; K-l 16 to P-494; C-l 17 to P-494; D-l 1 8 to P-494; Q-l 19 to P-494; G- 
120 to P-494; W-121 to P-494;Y-122 to P-494; G-l 23 to P-494; D-l 24 to P-494; A- 125 
to P-494; C-126 to P-494; Q-127 to P-494; Y-128 to P-494; P-129 toP-494; T-130 to P- 
494; N-131 to P-494; C-132 to P-494; D-133 to P-494; L-134 to P-494; T-135 to P-494; 
K-136 to P-494;K-137 to P-494; K-138 to P-494; S-139 to P-494; N-140 to P-494; Q- 
141 to P-494; M-142 to P-494; C-143 to P-494; K-144 toP-494; N-145 to P-494; S-146 
to P-494; Q-147 to P-494; D-148 to P-494; 1-149 to P-494; 1-150 to P-494; C-151 to P- 
494; S-152to P-494; N-153 to P-494; A-154 to P-494; G-155 to P-494; T-156 to P-494; 
C-157 to P-494; H-158 to P-494; C-159 to P-494;G-160 to P-494; R-161 to P-494; C-162 
to P-494; K-163 to P-494; C-164 to P-494; D-165 to P-494; N-166 to P-494; S-167 toP- 
494; D-168 to P-494; G-169 to P-494; S-170 to P-494; G-171 to P-494; L-172 to P-494; 
V-173 to P-494; Y-174 to P-494;G-175 to P-494; K-176 to P-494; F-177 to P-494; C-178 
to P-494; E-179 to P-494; C-180 to P-494; D-181 to P-494; D-182 toP-494; R-183 to P- 
494; E-I84 to P-494; C-185 to P-494; 1-186 to P-494; D-187 to P-494; D-188 to P-494; 
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E-189 to P-494; T-190to P-494; E-19I to P-494; E-192 to P-494; 1-193 to P-494; C-194 
to P-494; G-195 to P-494; G-196 to P-494; H-197 to P-494;G-198 to P-494; K-199 to P- 
494; C-200 to P-494; Y-201 to P-494; C-202 to P-494; G-203 to P-494; N-204 to P-494; 
C-205 toP-494; Y-206 to P-494; C-207 to P-494; K-208 to P-494; A-209 to P-494; G- 
5 210 to P-494; W-211 to P-494; H-212 to P-494;G-213 to P-494; D-214 to P-494; K-215 
to P-494; C-216 to P-494; E-217 to P-494; F-218 to P-494; Q-219 to P-494; C-220 toP~ 
494; D-221 to P-494; 1-222 to P-494; T-223 to P-494; P-224 to P-494; W-225 to P-494; 
E-226 to P-494; S-227 to P-494;K-228 to P-494; R-229 to P-494; R-230 to P-494; C-231 
to P-494; T-232 to P-494; S-233 to P-494; P-234 to P-494; D-235 toP-494; G-236 to P- 

10 494; K-237 to P-494; 1-238 to P-494; C-239 to P-494; S-240 to P-494; N-241 to P-494; 
R-242 to P-494;G-243 to P-494; T-244 to P-494; C-245 to P-494; V-246 to P-494; C-247 
to P-494; G-248 to P-494; E-249 to P-494; C-250 toP-494; T-251 to P-494; C-252 to P- 
494; H-253 to P-494; D-254 to P-494; V-255 to P-494; D-256 to P-494; P-257 to P- 
494;T-258 to P-494; G-259 to P-494; D-260 to P-494; W-261 to P-494; G-262 to P-494; 

15 D-263 to P-494; 1-264 to P-494; H-265 toP-494; G-266 to P-494; D-267 to P-494; T-268 
to P-494; C-269 to P-494; E-270 to P-494; C-271 to P-494; D-272 to P-494 ;E-273 to P- 
494; R-274 to P-494; D-275 to P-494; C-276 to P-494; R-277 to P-494; A-278 to P-494; 
V-279 to P-494; Y-280 toP-494; D-281 to P-494; R-282 to P-494; Y-283 to P-494; S-284 
to P-494; D 285 to P-494; D-286 to P-494; F-287 to P-494;C-288 to P-494; S-289 to P- 

20 494; G-290 to P-494; H-291 to P-494; G-292 to P-494; Q-293 to P-494; C-294 to P-494; 
N-295 toP-494; C-296 to P-494; G-297 to P-494; R-298 to P-494; C-299 to P-494; D- 
300 to P-494; C-301 to P-494; K-302 to P-494 ;A-303 to P-494; G-304 to P-494; W-305 
to P-494; Y-306 to P-494; G-307 to P-494; K-308 to P-494; K-309 to P-494; C-310 toP- 
494; E-31 1 to P-494; H-312 to P-494; P-313 to P-494; Q-314 to P-494; S-315 to P-494; 

25 C-316 to P-494; T-317 to P-494 ;L-318 to P-494; S-319 to P-494; A-320 to P-494; E-321 
to P-494; E-322 to P-494; S-323 to P-494; 1-324 to P-494; R-325 toP-494; K-326 to P- 
494; C-327 to P-494; Q-328 to P-494; G-329 to P-494; S-330 to P-494; S-331 to P-494; 
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D-332 to P-494;L-333 to P-494; P-334 to P-494; C-335 to P-494; S-336 to P-494; G-337 
to P-494; R-338 to P-494; G-339 to P-494; K-340 toP-494; C-341 to P-494; E-342 to P- 
494; C-343 to P-494; G-344 to P-494; K-345 to P-494; C-346 to P-494; T-347 to P- 
494;C-348 to P-494; Y-349 to P-494; P-350 to P-494; P-351 to P-494; G-352 to P-494; 
D-353 to P-494; R-354 to P-494; R-355 toP-494; V-356 to P-494; Y-357 to P-494; G- 
358 to P-494; K-359 to P-494; T-360 to P-494; C-361 to P-494; E-362 to P-494;C-363 to 
p-494; D-364 to P-494; D-365 to P-494; R-366 to P-494; R-367 to P-494; C-368 to P- 
494; E-369 to P-494; D-370 toP-494; L-371 to P-494; D-372 to P-494; G-373 to P-494; 
V-374 to P-494; V-375 to P-494; C-376 to P-494; G-377 to P-494;G-378 to P-494; H- 
379 to P-494; G-380 to P-494; T-381 to P-494; C-3S2 to P-494; S-383 to P-494; C-384 
to P-494; G-385 toP-494; R-386 to P-494; C-387 to P-494; V-388 to P-494; C-389 to P- 
494; E-390 to P-494; R-391 to P-494; G-392 to P-494;W-393 to P-494; F-394 to P-494; 
G-395 to P-494; K-396 to P-494; L-397 to P-494; C-398 to P-494; Q-399 to P-494; H- 
400 toP-494; P-401 to P-494; R-402 to P-494; K-403 to P-494; C-404 to P-494; N-405 to 
P-494; M-406 to P-494; T-407 to P-494;E-408 to P-494; E-409 to P-494; Q-410 to P- 
494; S-41 1 to P-494; K-412 to P-494; N-413 to P-494; L-414 to P-494; C-415 toP-494; 
E-416 to P-494; S-41 7 to P-494; A-418 to P-494; D-419 to P-494; G-420 to P-494; 1-421 
to P-494; L-422 to P-494; C-423to P-494; S-424 to P-494; G-425 to P-494; K-426 to P- 
494; G-427 to P-494; S-428 to P-494; C-429 to P-494; H-430 to P-494;C-431 to P-494; 
G-432 to P-494; K-433 to P-494; C-434 to P-494; 1-435 to P-494; C-436 to P-494; S-437 
to P-494; A-438 toP-494; E-439 to P-494; E-440 to P-494; W-441 to P-494; Y-442 to P- 
494; 1-443 to P-494; S-444 to P-494; G-445 to P-494;E-446 to P-494; F-447 to P-494; C- 
448 to P-494; D-449 to P-494; C-450 to P-494; D-451 to P-494; D-452 to P-494; R-453 
toP-494; D-454 to P-494; C-455 to P-494; D-456 to P-494; K-457 to P-494; H-458 to P- 
494; D-459 to P-494; G-460 to P-494;L-461 to P-494; 1-462 to P-494; C-463 to P-494; 
T-464 to P-494; G-465 to P-494; N-466 to P-494; G-467 to P-494; 1-468 toP-494; C-469 
to P-494; S-470 to P-494; C-471 to P-494; G-472 to P-494; N-473 to P-494; C-474 to P- 
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494; E-475 to P-494;C-476 to P-494; W-477 to P-494; D-478 to P-494; G-479 to P-494; 
W-480 to P-494; N-481 to P-494; G-482 to P-494; N-483 toP-494; A-484 to P-494; C- 
485 to P-494; E-486 to P-494; 1-487 to P-494; W-488 to P-494; L-489 to P-494; of SEQ 
ID NO:30. Polypeptides encoded by these polynucleotides are also encompassed by the 
invention. 

Also as mentioned above, even if deletion of one or more amino acids from the 
C-terminus of a protein results in modification or loss of one or more biological 
functions of the protein, other functional activities may still be retained. For example the 
ability of the shortened TIDE mutein to induce and/or bind to antibodies which recognize 
the complete or mature forms of the polypeptide generally will be retained when less 
than the majority of the residues of the complete or mature polypeptide are removed 
from the C-terminus. Whether a particular polypeptide lacking C-terminal residues of a 
complete polypeptide retains such immunologic activities can readily be determined by 
routine methods described herein and otherwise known in the art. It is not unlikely that 
an TIDE mutein with a large number of deleted C-terminal amino acid residues may 
retain some biological or immunogenic activities. In fact, peptides composed of as few as 
six TIDE amino acid residues may often evoke an immune response. 

Accordingly, the present invention further provides polypeptides having one or 
more residues deleted from the carboxy terminus of the amino acid sequence of the TIDE 
polypeptide shown in Figures 4A-C, up to the phenylalanine residue at position number 
6, and polynucleotides encoding such polypeptides. In particular, the present invention 
provides polypeptides comprising the amino acid sequence of residues 1-ml of Figure 1, 
where ml is an integer from 6 to 494 corresponding to the position of the amino acid 
residue in Figures 4A-C. Moreover, the invention provides polynucleotides encoding 
polypeptides comprising, or alternatively consisting of, the amino acid sequence of C- 
terminal deletions of the TIDE polypeptide of the invention shown as SEQ ID NO:30 
include polypeptides comprising the amino acid sequence of residues: M-l to Y-493; M- 
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1 to E-492; M-l to S-491; Ml to G-490; M-l to L-489; M-l to W-488; M-l toI-487; M- 
1 to E-486; M-i to C-485; M-l to A-484; M-l to N-483; M-i to G-482; M-l to N-481; 
M-l to W-480; M-l to G-479;M-1 to D-478; M-l to W-477; M-l to C-476; M-l to E- 
475; M-l to C-474; M-l to N-473; M-l to G-472; M-l to C-471; M-l toS-470; M-l to 
C-469; M-l to 1-468; M-l to G-467; M-l to N-466; M-l to G-465; M-l to T-464; M-l to 
C-463; M-l to 1-462; M-lto L-461 ; M-l to G-460; M-l to D-459; M-l to H-458; M-l to 
K-457; M-i to D-456; M-l to C-455; M-i to D-454; M-l to R-453;M-i to D-452; M-l 
to D-451; M-l to C-450; M-l to D-449; M-l to C-448; M-l to F-447; M-l to E-446; M 
1 to G-445; M-l toS-444; M-l to 1-443; M-l to Y-442; M-l to W-441; M-l to E-440; M- 
1 to E-439; M-l to A-438; M-l to S-437; M-l to C-436; M-lto 1-435; M-l to C-434; M- 
1 to K-433; M-l to G-432; M-i to C-431; M-i to H-430; M-l to C-429; M-l to S-428; 
M-l to G-427;M-1 to K-426; M-l to G-425; M-l to S-424; M-l to C-423; M-l to L-422; 
M-l to 1-421; M-l to G-420; M-l toD-419;M-l toA-418;M-l to S-417; M-l toE-416; 
M-l to C-415; M-l to L-414; M-l to N-413; M-l to K-412; M-l to S-411; M-l to Q- 
410;M-1 to E-409; M-l to E-408; M-l to T-407; M-l to M-406; M-l to N-405; M-l to 
C-404; M-l to K-403; M-l to R-402; M-i toP-401; M-l to H-400; M-l to Q-399; M-l to 
C-398; M-l to L-397; M-l to K-396; M-l to G-395; M-l to F-394; M-l to W-393;M-1 
to G-392; M-l to R-391; M-l to E-390; M-l to C-389; M-l to V-388; M-l to C-3S7; M- 
1 to R-386; M-l to G-385; M-l toC-384; M-l to S-383; M-l to C-382; M-l to T-381; M- 
1 to G-380; M-l to H-379; M-l to G-378; M-l to G-377; M-l to C-376;M-1 to V-375; 
M-l to V-374; M-l to G-373; M-l to D-372; M-i to L-371; M-l to D-370; M-i to E- 
369; M-l to C-368; M-l toR-367; M-l to R-366; M-l to D-365; M-l to D-364; M-l to 
C-363; M-l to E-362; M-l to C-361; M-l to T-360; M-l to K-359;M-1 to G-358; M-l to 
Y-357; M-l to V-356; M-l to R-355; M-l to R-354; M-l to D-353; M-l to G-352; M-l 
to P-351; M-l toP-350; M-l to Y-349; M-i to C-348; M-l to T-347; M-l to C-346; M-l 
to K-345; M-l to G-344; M-l to C-343; M-l to E-342;M-1 to C-341; M-l to K-340; M-l 
to G-339; M-l to R-338; M-l to G-337; M-l to S-336; M-l to C-335; M-l to P-334; M-l 
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toL-333; M-l to D-332; M-l to S-331; M-l to S-330; M-l to G-329; M-l to Q-328; M-l 
to C-327; M-l to K-326; M-l to R-325;M-1 to 1-324; M-l to S-323; M-l to E-322; M-l 
to E-321; M-l to A-320; M-l to S-319; M-l to L-318; M-l to T-317; M-l toC-316; M-l 
to S-315;M-1 to Q-314; M-l to P-313;M-1 to H-312;M-1 toE-311;M-l toC-310;M-l 
to K-309; M-l to K-308;M-1 to G-307; M-l to Y-306; M-l to W-305; M-l to G-304; M- 
1 to A-303; M-I to K-302; M-l to C-301; M-l to D-300; M-l toC-299; M-l to R-298; 
M-l to G-297; M-l to C-296; M-l to N-295; M-l to C-294; M-l to Q-293; M-l to G- 
292; M-l to H-291 ;M-1 to G-290; M-l to S-289; M-l to C-288; M-l to F-287; M-l to 
D-286; M-l to D-285; M-l to S-284; M-l to Y-283; M-l toR-282; M-l to D-281; M-l to 
Y-280; M-l to V-279; M-l to A-278; M-l to R-277; M-l to C-276; M-l to D-275; M-l 
to R-274;M-1 to E-273; M-l to D-272; M-l to C-271; M-l to E-270; M-l to C-269; M-l 
to T-268; M-l to D-267; M-l to G-266; M-l toH-265; M-l to 1-264; M-l to D-263; M-l 
to G-262; M-l to W-261; M-l to D-260; M-l to G-259; M-l to T-258; M-l to P-257;M- 
1 to D-256;M-1 to V-255; M-l toD-254; M-l to H-253; M-l to C-252; M-l toT-251; 
M-l to C-250; M-l to E-249; M-l toG-248; M-l to C-247; M-l to V-246; M-l to C-245; 
M-l to T-244; M-l to G-243; M-l to R-242; M-l to N-241; M-l to S-240;M-1 to C-239; 
M-l to 1-238; M-l to K-237; M-l to G-236; M-l to D-235; M-l to P-234; M-l to S-233; 
M-l to T-232; M-l toC-23i; M-l to R-230; M-l to R-229; M-l to K-228; M-l to S-227; 
M-l toE-226; M-l to W-225; M-l to P-224; M-l to T-223;M-1 to 1-222; M-l to D-221; 
M-l to C-220; M-l to Q-219; M-l to F-218; M-l to E-217; M-l to C-216; M-l to K- 
215; M-l toD-214; M-l to G-213; M-l to H-212; M-l to W-21 1 ; M-l to G-210; M-l to 
A-209; M-l to K-208; M-l to C-207; M-l to Y-206;M-1 to C-205; M-l to N-204; M-l 
to G-203; M-l to C-202; M-l to Y-201; M-l to C-200; M-l to K-199; M-l to G-198; M- 
1 toH-197; M-l to G-196; M-l to G-195; M l to C-194; M-l to 1-193; M-l to E-192; M- 
1 to E-191; M-l to T-190; M-l to E-189; M-lto D-188; M-l to D-187; M-l to 1-186; M- 
1 to C-185;M-1 to E-184; M-l to R-183;M-1 toD-182; M-l to D-181;M-1 to C- 
180;M-1 to E-179; M-l to C-178; M-l to F-177; M-l to K-176; M-l to G-175; M-l to 
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Y-174; M-l to V-173; M-l to L-172; M-l toG-171; M-l to S-170; M-l to G-169; M-l to 
D-168; M-l to S-167; M-l to N-166; M-l to D-165; M-l to C-164; M-l to K-163;M-1 to 
C-162;M-1 to R-161;M-1 to G-160; M-l to C-159;M-1 toH-158;M-l toC-157;M-l 
to T-156; M-l to G-155; M-l toA-154; M-l to N-153; M-l to S-152; M-l to C-151; M-l 
to 1-150; M-l to 1-149; M-l to D-148; M-l to Q-147; M-l to S-146; M-l to N-145; M-l 
to K-144; M-l to C-143; M-l to M-142; M-l to Q-141; M-l to N-140; M-l to S-139; M- 
1 to K-138;M-1 to K-137;M-1 to K-136; M-l to T-135;M-1 toL-134;M-l to D-133; 
M-l to C-132; M-l to N-131; M-l to T-130; M-l to P-129; M-l toY-128; M-l to Q-127; 
M-l to C-126; M-l to A-125; M-l to D-124; M-l to G-123; M-l to Y-122; M-l to W- 
121; M-l to G-120;M-1 to Q-119;M-I to D-118;M-1 to C-117;M-1 to K-116;M-1 to 
C-115; M-l to K-114;M-1 to G-113; M-l to C-112;M-1 toD-lll;M-l to C-110; M-l to 
K-109; M-l to G-108; M-l to H-107; M-l to G-106; M-l to A-105; M-l to C-104; M-l 
to T-103;M-1 to S-102; M-l to G-101; M-l to D-100; M-l to Y-99; M-l to T-98; M-l to 
E-97; M-l to C-96; M-l to V-95; M-l to W-94;M-1 to E-93; M-l to H-92; M-l to C-91; 
M-l to E-90; M-l to C-89; M-l to L-88; M-l to P-87; M-l to G-86; M-l to F-85; M-l 
toF-84; M-l to M-83; M-l to G-82; M-l to P-81; M-l to E-80; M-l to T-79; M-l to V- 
78; M-l to H-77; M-l to C-76; M-l to I-75;M-1 to C-74; M-l to V-73; M-l to G-72; M- 
1 to C-71; M-l to D-70; M-l to C-69; M-l to R-68; M-l to G-67; M-l to R-66; M-l 
toG-65; M-l to H-64; M l to C-63; M-l to L-62; M-l to A-61; M-l to A-60; M-l to G- 
59; M-l to P-58; M-l to P-57; M-l to Q-56;M-1 to G-55; M-l to P-54; M-l to A-53; M- 
1 to R-52; M-l to C-51; M-l to R-50; M-l to R-49; M-l to E-48; M-l to S-47; M-l toE- 
46; M-l to A-45; M-l to R-44; M-l to S-43; M-l to L-42; M-l to R-41; M-l to C-40; M- 
1 to A-39; M-l to A-38; M-l to G-37;M-1 to P-36; M-l to W-35; M-l to S-34; M-l to 
R-33; M-l to L-32; M-l to S-31; M-l to P~30; M-l to S-29; M-l to F-28; M-l toS-27; 
M-l to Q-26; M-l to P-25; M-l to V-24; M-l to A-23; M-l to S-22; M-l to L-21; M-l to 
G-20; M-l to A-19; M-l toF-18;M-l to L-17; M-l to L-16;M-1 toS-15; M-l to S-14; 
M-l to A-13;M-1 to L-12; M-l toL-ll;M-l to L-10; M-l to F-9; M-l toN-8; M-l to R- 
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7; M-I to F-6; of SEQ ID NO: 30. Polypeptides encoded by these polynucleotides are 
also encompassed by the invention. 

In addition, the invention provides nucleic acid molecules having nucleotide 
sequences related to extensive portions of SEQ ID NO: 12 which have been determined 
from the following related cDNA genes: HLHFV34R (SEQ ID NO:68), HSRDA85R 
(SEQ ID NO:69), HSRAZ62R (SEQ ID NO:70), HSRDA17R (SEQ ID NO:71), and 
HSLEC45R (SEQ ID NO:72). 

Based on the sequence similarity to the human integrin beta-8 subunit, translation 
product of this gene is expected to share at least some biological activities with integrin 
proteins, and specifically the human integrin beta-8 subunit. Such activities are known in 
the art, some of which are described elsewhere herein. 

Specifically, polynucleotides and polypeptides of the invention are also useful for 
modulating the differentiation of normal and malignant cells, modulating the 
proliferation and/or differentiation of cancer and neoplastic cells, and modulating the 
immune response. Polynucleotides and polypeptides of the invention may represent a 
diagnostic marker for hematopoietic and immune diseases and/or disorders. The full- 
length protein should be a secreted protein, based upon homology to the integrin family. 
Therefore, it is secreted into serum, urine, or feces and thus the levels is assayable from 
patient samples. Assuming specific expression levels are reflective of the presence of 
immune disorders, this protein would provide a convenient diagnostic for early detection. 
In addition, expression of this gene product may also be linked to the progression of 
immune diseases, and therefore may itself actually represent a therapeutic or therapeutic 
target for the treatment of cancer. Polynucleotides and polypeptides of the invention may 
play an important role in the pathogenesis of human cancers and cellular transformation, 
particularly those of the immune and hematopoietic systems. Polynucleotides and 
polypeptides of the invention may also be involved in the pathogenesis of developmental 
abnormalities based upon its potential effects on proliferation and differentiation of cells 
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and tissue cell types. Due to the potential proliferating and differentiating activity of said 
polynucleotides and polypeptides, the invention is useful as a therapeutic agent in 
inducing tissue regeneration, for treating inflammatory conditions (e.g., inflammatory 
bowel syndrome, diverticulitis, etc.). Moreover, the invention is useful in modulating the 
immune response to aberrant polypeptides, as may exist in rapidly proliferating cells and 
tissue cell types, particularly in adenocarcinoma cells, and other cancers. 

Alternatively, the expression within cellular sources marked by proliferating cells 
indicates this protein may play a role in the regulation of cellular division, and may show 
utility in the diagnosis, treatment, anoVor prevention of developmental diseases and 
disorders, including cancer, and other proliferative conditions. Representative uses are 
described in the "Hyperproliferative Disorders" and "Regeneration" sections below and 
elsewhere herein. Briefly, developmental tissues rely on decisions involving cell 
differentiation and/or apoptosis in pattern formation. 

Dysregulation of apoptosis can result in inappropriate suppression of cell death, 
as occurs in the development of some cancers, or in failure to control the extent of cell 
death, as is believed to occur in acquired immunodeficiency and certain 
neurodegenerative disorders, such as spinal muscular atrophy (SMA). 

Alternatively, this gene product is involved in the pattern of cellular proliferation 
that accompanies early embryogenesis. Thus, aberrant expression of this gene product in 
tissues - particularly adult tissues - may correlate with patterns of abnormal cellular 
proliferation, such as found in various cancers. Because of potential roles in proliferation 
and differentiation, this gene product may have applications in the adult for tissue 
regeneration and the treatment of cancers. It may also act as a morphogen to control cell 
and tissue type specification. Therefore, the polynucleotides and polypeptides of the 
present invention are useful in treating, detecting, and/or preventing said disorders and 
conditions, in addition to other types of degenerative conditions. Thus this protein may 
modulate apoptosis or tissue differentiation and is useful in the detection, treatment, 
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and/or prevention of degenerative or proliferative conditions and diseases. The protein is 
useful in modulating the immune response to aberrant polypeptides, as may exist in 
proliferating and cancerous cells and tissues. The protein can also be used to gain new 
insight into the regulation of cellular growth and proliferation. Furthermore, the protein 
may also be used to determine biological activity, to raise antibodies, as tissue markers, 
to isolate cognate ligands or receptors, to identify agents that modulate their interactions, 
in addition to its use as a nutritional supplement. Protein, as well as, antibodies directed 
against the protein may show utility as a tumor marker and/or immunotherapy targets for 
the above listed tissues. 

The translation product of this gene, sometimes referred to herein as TIDE (for 
Ten Integrin Domains with EGF homology), shares sequence homology with integrins, 
which are a superfamily of dimeric ab cell-surface glycoproteins that mediate the 
adhesive functions of many cell types, enabling cells to interact with one another and 
with the extracellular matrix (See Genomics 56, 169-178 (1999); all information and 
references contained within this publication are hereby incorporated herein by reference). 

The gene encoding the disclosed cDNA is believed to reside on chromosome 13, 
at locus 13q33. Accordingly, polynucleotides related to this invention are useful as a 
marker in linkage analysis for chromosome 13, generally, and particularly at locus 
13q33. 

This gene is expressed primarily in synovial hypoxia tissue, osteoblast and 
osteoclast, bone marrow stromal cells, and to a lesser extent in umbilical vein, smooth 
muscle, placenta, and fetal lung cDNA libraries. Therefore, polynucleotides and 
polypeptides of the invention are useful as reagents for differential identification of the 
tissue(s) or cell type(s) present in a biological sample and for diagnosis of diseases and 
conditions which include, but are not limited to, disorders of bone and connective tissues, 
immune and hematopoietic diseases and/or disorders, vascular disorders, and other 
disorders involving aberrations in cell-surface interactions. Similarly, polypeptides and 
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antibodies directed to these polypeptides are useful in providing immunological probes 
for differential identification of the tissue(s) or cell type(s). For a number of disorders of 
the above tissues or cells, particularly of the connective tissue and skeletal system, 
expression of this gene at significantly higher or lower levels is routinely detected in 
certain tissues or cell types (e.g. cartilage, bone, vascular, hypoxic tissue, and cancerous 
and wounded tissues) or bodily fluids (e.g., lymph, serum, plasma, urine, synovial fluid 
and spinal fluid) or another tissue or cell sample taken from an individual having such a 
disorder, relative to the standard gene expression level, i.e., the expression level in 
healthy tissue or bodily fluid from an individual not having the disorder. 

Preferred polypeptides of the present invention comprise immunogenic epitopes 
shown in SEQ ID NO: 30 as residues: Met-1 to Phe-6, Arg-44 to Arg-52, His-64 to Cys- 
69, Tyr-99 to Gln-147, His-158 to GIy-169, Phe-177 to Asp-182, Cys-194 to Cys~202, 
Gly-213 to Phe-218, Pro-224 to Gly-236, Asp-254 to Trp-261, Asp-263 to Ala-303, Trp- 
305 to Cys-316, Lys-326 to Asp-332, Pro-334 to Cys-343, Pro-350 to Asp-370, Thr-407 
to Asn-413, Gly-425 to Cys-431, Asp-449 to Asp-459, Gly-472 to Asn-483. 
Polynucleotides encoding said polypeptides are also provided. 

The tissue distribution and homology to the human integrin beta-8 subunit 
indicates polynucleotides and polypeptides corresponding to this gene are useful for the 
diagnosis and treatment of a variety of immune system disorders. Representative uses are 
described in the "Immune Activity" and "infectious disease" sections below, in Example 
11, 13, 14, 16, 18, 19, 20, and 27, and elsewhere herein. 

Briefly, the expression indicates a role in regulating the proliferation; survival; 
differentiation; and/or activation of hematopoietic cell lineages, including blood stem 
cells. Involvement in the regulation of cytokine production, antigen presentation, or other 
processes indicates a usefulness for treatment of cancer (e.g. by boosting immune 
responses). Expression in cells of lymphoid origin, indicates the natural gene product is 
involved in immune functions. Therefore it would also be useful as an agent for 
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immunological disorders including arthritis, asthma, immunodeficiency diseases such as 
AIDS, leukemia, rheumatoid arthritis, granulomatous Disease, inflammatory bowel 
disease, sepsis, acne, neutropenia, neutrophilia, psoriasis, hypersensitivities, such as T- 
cell mediated cytotoxicity; immune reactions to transplanted organs and tissues, such as 
5 host-versus-graft and graft- versus-host diseases, or autoimmunity disorders, such as 

autoimmune infertility, lense tissue injury, demyelination, systemic lupus erythematosis, 
drug induced hemolytic anemia, rheumatoid arthritis, Sjogren's Disease, and 
scleroderma. Moreover, the protein may represent a secreted factor that influences the 
differentiation or behavior of other blood cells, or that recruits hematopoietic cells to 

10 sites of injury. Thus, this gene product is thought to be useful in the expansion of stem 
cells and committed progenitors of various blood lineages, and in the differentiation 
and/or proliferation of various cell types. Based upon the tissue distribution of this 
protein, antagonists directed against this protein is useful in blocking the activity of this 
protein. Accordingly, preferred are antibodies which specifically bind a portion of the 

15 translation product of this gene. 

Also provided is a kit for detecting tumors in which expression of this protein 
occurs. Such a kit comprises in one embodiment an antibody specific for the translation 
product of this gene bound to a solid support. Also provided is a method of detecting 
these tumors in an individual which comprises a step of contacting an antibody specific 

20 for the translation product of this gene to a bodily fluid from the individual, preferably 
serum, and ascertaining whether antibody binds to an antigen found in the bodily fluid. 
Preferably the antibody is bound to a solid support and the bodily fluid is serum. The 
above embodiments, as well as other treatments and diagnostic tests (kits and methods), 
are more particularly described elsewhere herein. 

25 Furthermore, the protein may also be used to determine biological activity, raise 

antibodies, as tissue markers, to isolate cognate iigands or receptors, to identify agents 
that modulate their interactions, in addition to its use as a nutritional supplement. Protein, 
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as well as, antibodies directed against the protein may show utility as a tumor marker 
and/or immunotherapy targets for the above listed tissues. 

Many polynucleotide sequences, such as EST sequences, are publicly available 
and accessible through sequence databases. Some of these sequences are related to SEQ 
5 ID NO: 12 and may have been publicly available prior to conception of the present 
invention. Preferably, such related polynucleotides are specifically excluded from the 
scope of the present invention. To list every related sequence is cumbersome. 
Accordingly, preferably excluded from the present invention are one or more 
polynucleotides comprising a nucleotide sequence described by the general formula of a- 
10 b, where a is any integer between 1 to 2485 of SEQ ID NO;12, b is an integer of 15 to 
2499, where both a and b correspond to the positions of nucleotide residues shown in 
SEQ ID NO: 12, and where b is greater than or equal to a + 14. 


15 FEATURES OF PROTEIN ENCODED BY GENE NO: 3 

The translation product of this gene shares sequence homology with RAMP3 
(receptor-activity-modifying proteins), which another group has recently published, 
which is thought to be important in the transport of the calcitonin-receptor-like receptor 

20 (CRLR) to the plasma membrane. RAMPs regulate the transport and ligand specificity of 
the calcitonin-receptor-like-receptor. There are two other related receptor-activity- 
modifying proteins, known as RAMP1 and RAMP2 (Nature 1998 May 
28;393<6683):333-9). RAMP1 is thought to present the receptor at the cell surface as a 
mature glycoprotein and a Calcitonin -gene-related peptide (CGRP) receptor. 

25 Alternatively, RAMP2-tran sported receptors are core-glycosylated and are 

adrenomedullin receptors. CGRP (a 37-amino-acid neuropeptide) and its receptors are 
widely distributed in the body, and it is the most potent endogenous vasodilatory peptide 
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discovered so far (Crit Rev Neurobiol 1997; 11 (2-3): 167-239). Specific binding sites for 
adrenomedullin were present in every region of human brain (cerebral cortex, 
cerebellum, thalamus, hypothalamus, pons and medulla oblongata), suggesting that a 
novel neurotransmitter/neuromodulator role may exist for adrenomedullin in human 
brain (Peptides 1997;18(8):1 125-9). 

Figures 7A-B show the nucleotide (SEQ ID NO: 13) and deduced amino acid 
sequence (SEQ ID NO:31 ) of the Intestine derived extracellular protein. Predicted amino 
acids from about 1 to about 27 constitute the predicted signal peptide (amino acid 
residues from about I to about 27 in SEQ ID NO:31) and are represented by the 
underlined amino acid regions; and amino acids from about 122 to about 138 constitute 
the predicted transmembrane domain (amino acid residues from about 122 to about 138 
in SEQ ID NO:31) and are represented by the double-underlined amino acids. 

Figure 8 shows the regions of similarity between the amino acid sequences of the 
Intestine derived extracellular protein SEQ ID NO:31, and the RAMP3 protein 
(gi|4587099) (SEQ ID NO: 75). 

Figure 9 shows an analysis of the amino acid sequence of SEQ ID NO: 31. 
Alpha, beta, turn and coil regions; hydrophilicity and hydrophobicity; 
amphipathic regions; flexible regions; antigenic index and surface probability are shown. 

Northern analysis indicates that a 1.4kb transcript of this gene is primarily 
expressed in small intestine tissue, and to a lesser extent in colon and prostate tissue. 

The present invention provides isolated nucleic acid molecules comprising a 
polynucleotide encoding the polypeptide having the amino acid sequence shown in 
Figure 1 (SEQ ID NO:31), which was determined by sequencing a cloned cDNA 
(HTLEW81). The nucleotide sequence shown in Figures 7A-B (SEQ ID NO: 13) was 
obtained by sequencing a cloned cDNA (HTLEW81), which was deposited on Nov. 17, 
1998 at the American Type Culture Collection, and given Accession Number 203484. 
The deposited gene is inserted in the pSport plasmid (Life Technologies, Rockville, MD) 
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using the Sall/NotI restriction endonuclease cleavage sites. The present invention is 
further directed to fragments of the isolated nucleic acid molecules described herein. By 
a fragment of an isolated DNA molecule having the nucleotide sequence of the deposited 
cDNA or the nucleotide sequence shown in SEQ ID NO: 13 is intended DNA fragments 
at least about 15nt, and more preferably at least about 20 nt, still more preferably at least 
about 30 nt, and even more preferably, at least about 40 nt in length which are useful as 
diagnostic probes and primers as discussed herein. Of course, larger fragments 50-1500 
nt in length are also useful according to the present invention, as are fragments 
corresponding to most, if not all, of the nucleotide sequence of the deposited cDNA or as 
shown in SEQ ID NO: 13. By a fragment at least 20 nt in length, for example, is intended 
fragments which include 20 or more contiguous bases from the nucleotide sequence of 
the deposited cDNA or the nucleotide sequence as shown in SEQ ID NO: 13. In this 
context "about" includes the particularly recited size, larger or smaller by several (5, 4, 3, 
2, or 1) nucleotides, at either terminus or at both termini. Representative examples of 
polynucleotide fragments of the invention include, for example, fragments that comprise, 
or alternatively, consist of, a sequence from about nucleotide 1 to about 50, from about 
51 to about 100, from about 101 to about 150, from about 151 to about 200, from about 
201 to about 250, from about 251 to about 300, from about 301 to about 350, from about 
35 1 to about 400, from about 401 to about 450, from about 451 to about 500, and from 
about 501 to about 550, and from about 551 to about 600, from about 601 to about 650, 
from about 651 to about 700, from about 701 to about 750, from about 751 to about 800, 
from about 801 to about 850, from about 851 to about 900, from about 901 to about 950, 
from about 951 to about 1000, from about 1001 to about 1050, from about 1051 to about 
1100, from about 1 101 to about 1150, from about 1151 to about 1200, from about 1201 
to about 1250, from about 1251 to about 1300, from about 1301 to about 1339 of SEQ 
ID NO: 13, or the complementary strand thereto, or the cDNA contained in the deposited 
gene. In this context "about" includes the particularly recited ranges, larger or smaller by 
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several (5, 4, 3, 2, or I) nucleotides, at either terminus or at both termini. In additional 
embodiments, the polynucleotides of the invention encode functional attributes of the 
corresponding protein. 

Preferred embodiments of the invention in this regard include fragments that 
comprise alpha-helix and aipha-helix forming regions ("alpha-regions"), beta-sheet and 
beta-sheet forming regions ("beta-regions"), turn and turn-forming regions ("turn- 
regions"), coil and coil-forming regions ("coil-regions"), hydrophilic regions, 
hydrophobic regions, alpha amphipathic regions, beta amphipathic regions, flexible 
regions, surface-forming regions and high antigenic index regions. The data representing 
the structural or functional attributes of the protein set forth in Figure 9 and/or Table III. 
as described above, was generated using the various modules and algorithms of the 
DNA*STAR set on default parameters. In a preferred embodiment, the data presented in 
columns VIII, IX, XIII, and XIV of Table IIII can be used to determine regions of the 
protein which exhibit a high degree of potential for antigenicity. Regions of high 
antigenicity are determined from the data presented in columns VIII, IX, XIII, and/or 
XIV by choosing values which represent regions of the polypeptide which are likely to 
be exposed on the surface of the polypeptide in an environment in which antigen 
recognition may occur in the process of initiation of an immune response. 

Certain preferred regions in these regards are set out in Figure 9, but may, as 
shown in Table III, be represented or identified by using tabular representations of the 
data presented in Figure 9. The DNA*STAR computer algorithm used to generate Figure 
9 (set on the original default parameters) was used to present the data in Figure 9 in a 
tabular format (See Table III). The tabular format of the data in Figure 9 is used to easily 
determine specific boundaries of a preferred region. The above-mentioned preferred 
regions set out in Figure 9 and in Table III include, but are not limited to, regions of the 
aforementioned types identified by analysis of the amino acid sequence set out in Figures 
7A-B. As set out m Figure 9 and in Table III, such preferred regions include Garnier- 
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Robson alpha-regions, beta-regions, turn-regions, and coil-regions, Chou-Fasman alpha- 
regions, beta-regions, and turn-regions, Kyte-Doolittle hydrophilic regions and Hopp- 
Woods hydrophobic regions, Eisenberg alpha- and beta-amphipathic regions, Karplus- 
Schulz flexible regions, Jameson-Wolf regions of high antigenic index and Emini 
surface-forming regions. Even if deletion of one or more amino acids from the N- 
terminus of a protein results in modification of loss of one or more biological functions 
of the protein, other functional activities (e.g., biological activities, ability to 
multimerize, etc.) may still be retained. For example, the ability of shortened muteins to 
induce and/or bind to antibodies which recognize the complete or mature forms of the 
polypeptides generally will be retained when less than the majority of the residues of the 
complete or mature polypeptide are removed from the N-terminus. Whether a particular 
polypeptide lacking NT-terminal residues of a complete polypeptide retains such 
immunologic activities can readily be determined by routine methods described herein 
and otherwise known in the art. It is not unlikely that a mutein with a large number of 
deleted N-terminal amino acid residues may retain some biological or immunogenic 
activities. In fact, peptides composed of as few as six amino acid residues may often 
evoke an immune response. 

Accordingly, the present invention further provides polypeptides having one or 
more residues deleted from the amino terminus of the amino acid sequence shown in 
Figures 7A-B, up to the arginine residue at position number 143 and polynucleotides 
encoding such polypeptides. In particular, the present invention provides polypeptides 
comprising the amino acid sequence of residues nl-148 of Figures 7A-B, where nl is an 
integer from 2 to 143 corresponding to the position of the amino acid residue in Figures 
7A-B (which is identical to the sequence shown as SEQ ID NO:31). N-terminal deletions 
of the polypeptide of the invention shown as SEQ ID NO:31 include polypeptides 
comprising the amino acid sequence of residues: E-2 to L-148; T-3 to L-148; G-4 toL- 
148; A-5 to L-148; L-6 to L-148; R-7 to L-148; R-8 to L-148;P-9 to L-148; Q-10 to L- 
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148; L-l 1 to L-148; L-12 to L-148; P-13to L-148; L-14 to L-148; L-15 to L-148; L-16 to 
L-148; L-17 toL-148; L-18 to L-148; C-19 to L-148; G-20 to L-148; G-21 toL-148; C-22 
to L-148; P-23 to L-148; R-24 to L-148; A-25 toL-148; G-26 to L-148; G-27 to L-148; 
C-28 to L-148; N-29 toL-148; E-30 to L-148; T-31 to L-148; G-32 to L-148; M-33 toL- 
148; L-34 to L-148; E-35 to L-148; R-36 to L-148; L-37 toL-148; P-38 to L-148; L-39 to 
L-148; C-40 to L-148; G-41 toL-148; K-42 to L-148; A-43 to L-148; F-44 to L-148; A- 
45 toL-148; D-46 to L-148; M-47 to L-148; M-48 to L-148; G-49 toL- 148; K-50 to L- 
148; V-51 to L-148; D-52 to L-148; V-53 toL-148; W-54 to L-148; K-55 to L-148; W-56 
to L-148; C-57 toL-148; N-58 to L-148; L-59 to L-148; S-60 to L-148; E-61 toL-148; F- 
62 to L-148; 1-63 to L-148; V-64 to L-148; Y-65 toL-148; Y-66 to L-148; E-67 to L-148; 
S-68 to L-148; F-69 toL-148; T-70 to L-148; N-71 to L-148; C-72 to L-148; T-73 toL- 
148; E-74 to L-148; M-75 to L-148; E-76 to L-148; A-77 toL-148; N-78 to L-148; V-79 
to L-148; V-80 to L-148; G-81 toL-148; C-82 to L-148; Y-83 to L-148; W-84 to L-148; 
P-85 toL-148; N-86 to L-148; P-87 to L-148; L-88 to L-148; A-89 toL-148; Q-90 to L- 
148; G-91 to L-148; F-92 to L-148; 1-93 toL-148; T-94 to L-148; G-95 to L-148; 1-96 to 
L-148; H-97 toL-148; R-98 to L-148; Q-99 to L-148; F-100 to L-148; F-101 toL-148; S- 
102 to L-148; N-103 to L-148; C-104 to L-148; T-105 toL-148; V-106 to L-148; D-107 
to L-148; R-108 to L-148; V-109 toL-148; H-l 10 to L-148; L-ll 1 to L-148; E-l 12 to L- 
148; D-l 13 toL-148; P-l 14 to L-148; P-l 15 to L-148; D-l 16 to L-148; E-l 17 toL-148; 
V-l 18 to L-148; L-119 to L-148; 1-120 to L-148; P-121 toL-148; L-122 to L-148; 1-123 
to L-148; V-124 to L-148; 1-125 toL-148; P-126 to L-148; V-127 to L-148; V-128 to L- 
148; L-129 toL-148; T-130 to L-148; V-131 to L-148; A-132 to L-148; M-133 toL-148; 
A-134 to L-148; G-135 to L-148; L-136 to L-148; V-137 toL-148; V-138 to L-148; W- 
139 to L-148; R-140 to L-148; S-141 toL-148; K-I42 to L-148; R-143 to L-148; of SEQ 
ID NO:3L Polypeptides encoded by these polynucleotides are also encompassed by the 
invention. 
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Also as mentioned above, even if deletion of one or more amino acids from the 
C-terminus of a protein results in modification or loss of one or more biological 
functions of the protein, other functional activities (e.g., biological activities (e.g., ability 
to illicit mitogenic activity, induce differentiation of normal or malignant cells, bind to 
EGF receptors, etc.)), may still be retained. For example the ability to induce and/or bind 
to antibodies which recognize the complete or mature forms of the polypeptide generally 
will be retained when less than the majority of the residues of the complete or mature 
polypeptide are removed from the C-terminus. Whether a particular polypeptide lacking 
C-terminal residues of a complete polypeptide retains such immunologic activities can 
readily be determined by routine methods described herein and otherwise known in the 
art. It is not unlikely that a mutein with a large number of deleted C-terminal amino acid 
residues may retain some biological or immunogenic activities. In fact, peptides 
composed of as few as six amino acid residues may often evoke an immune response. 

Accordingly, the present invention further provides polypeptides having one or 
more residues deleted from the carboxy terminus of the amino acid sequence of the 
polypeptide shown in Figures 7A-B, up to the arginine residue at position number 7, and 
polynucleotides encoding such polypeptides. In particular, the present invention provides 
polypeptides comprising the amino acid sequence of residues 1-ml of Figures 7A-B, 
where ml is an integer from 7 to 147 corresponding to the position of the amino acid 
residue in Figures 7A-B. Moreover, the invention provides polynucleotides encoding 
polypeptides comprising, or alternatively consisting of, the amino acid sequence of C- 
terminal deletions of the polypeptide of the invention shown as SEQ ID NO:31 include 
polypeptides comprising the amino acid sequence of residues: Ml to L-147; M-l to T- 
146;M-i to D-145; M-l to T-144; M-l to R-143; M-l to K-142; M-l toS-141; M-l to R- 
140; M-l to W-139; M-l to V-138; M-l to V-137;M-1 to L-136; M-l to G-135; M-l to 
A-134; M-l to M-133; M-l toA-132; M-l to V-131;M-1 to T-130; M-l to L- 129; M-l 
to V-128;M-1 to V-127; M-l to P-126; M-l to 1-125; M-l to V-124; M-l toI-123; M-l to 
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L-I22;M-1 toP-121;M-l to 1-120; M-l to L-l 19;M-1 to V-118; M-l to E-l 17; M-l to 
D-116; M-l toP-115; M-l toP-114;M-l toD-113;M-l toE-112; M-l toL-lll;M-l to 
H-110;M-1 to V-109; M-l to R-108; M-l to D-107; M-l to V-106; M-l toT-105; M-l to 
C-104; M-l to N-103; M-l to S-102; M-l to F-10l;M-l to F-100; M-l to Q-99; M-l to 
R-98; M-l to H-97; M-l toI-96; M-l to G-95; M-l to T-94; M-l to 1-93; M-l to F-92; 
M-l toG-91; M-l to Q-90; M-l to A-89; M-l to L-88; M-l to P-87; M-lto N-86; M-l to 
P-85; M-l to W-84; M-l to Y-83; M-l to C-82;M-i to G-81; M-l to V-80; M-l to V-79; 
M-l to N-78; M-l toA-77; M-l to E-76; M-l to M-75; M-l to E-74; M-l to T-73; M-lto 
C-72; M-l to N-71; M-l to T-70; M-i to F-69; M-l to S-68;M-1 toE-67; M-l to Y-66; 
M-l to Y-65; M-l to V-64; M-l to I-63;M-1 to F-62; M-l to E-61; M-l to S-60; M-l to 
L-59; M-l to N-58; M-l to C-57; M-l to W-56; M-l to K-55; M-l to W-54; M-l toV- 
53; M-l to D-52; M-l to V-51; M-l to K-50; M-l to G-49; M-lto M-48; M-l to M-47; 
M-l to D-46; M-l to A -45; M-l to F-44;M~i to A-43; M-l to K-42; M-l to G-41; M-l to 
C-40; M-l toL-39; M-l to P-38; M-l to L-37; M-l to R-36; M-l to E-35; M-l toL-34; 
M-l to M-33; M-l to G-32; M-l to T-31; M-l to E-30; M-lto N-29; M-l to C-28; M-l 
to G-27; M-l to G-26; M-l to A-25;M-1 to R-24; M-l to P-23; M-l to C-22; M-i to G- 
21; M-l toG-20; M-l to C-19; M-l to L-18; M-l to L-17; M-l to L-16; M-lto L-15; M-l 
to L-14; M-l to P-13; M-l to L-12; M-l to L-l 1; M-lto Q-10; M l to P-9; M-l to R-8; 
M-l to R-7; of SEQ ID NO:31. Polypeptides encoded by these polynucleotides are also 
encompassed by the invention. 

In addition, the invention provides nucleic acid molecules having nucleotide 
sequences related to extensive portions of SEQ ID NO:31 which have been determined 
from the following related cDNA genes: HLHCH17RA (SEQ ID NO:76)> HTOAT51R 
(SEQ ID NO:77), and/or HBNB041R (SEQ ID NO:78). 

The polypeptide of this gene has been determined to have a transmembrane 
domain at about amino acid position 122 - 138 of the amino acid sequence referenced in 
Table XIII for this gene. Moreover, a cytoplasmic tail encompassing amino acids 139 to 
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149 of this protein has also been determined. Based upon these characteristics, it is 
believed that the protein product of this gene shares structural features to type la 
membrane proteins. 

Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identification of the tissue(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditions which include, but are not 
limited to, gastrointestinal and neurodegenerative diseases and disorders. Similarly, 
polypeptides and antibodies directed to these polypeptides are useful in providing 
immunological probes for differential identification of the tissue(s) or cell type(s). For a 
number of disorders of the above tissues or cells, particularly of the central nervous and 
gastrointestinal systems, expression of this gene at significantly higher or lower levels is 
routinely detected in certain tissues or cell types (e.g. brain, CNS, gastrointestinal, 
cancerous and wounded tissues) or bodily fluids (e.g., lymph, serum, plasma, urine, 
synovial fluid and spinal fluid) or another tissue or cell sample taken from an individual 
having such a disorder, relative to the standard gene expression level, i.e., the expression 
level in healthy tissue or bodily fluid from an individual not having the disorder. 

Preferred polypeptides of the present invention comprise immunogenic epitopes 
shown in SEQ ID NO; 31 as residues: AIa-5 to Gln-10, Pro-23 to Cys-28, Arg-140 to 
Asp- 145. Polynucleotides encoding said polypeptides are also provided. 

The tissue distribution and homology to RAMP3 suggest that the translation 
product of this gene is useful for the detection/treatment of neurodegenerative disease 
states and behavioural disorders such as Alzheimer's Disease, Parkinson's Disease, 
Huntington's Disease, Tourette Syndrome, schizophrenia, mania, dementia, paranoia, 
obsessive compulsive disorder, panic disorder, learning disabilities, ALS, psychoses, 
autism, and altered behaviors, including disorders in feeding, sleep patterns, balance, and 
perception. In addition, the gene or gene product may also play a role in the treatment 
and/or detection of developmental disorders associated with the developing embryo. 
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Alternatively, the tissue distribution in small intestine and colon tissues indicates 
that polynucleotides and polypeptides corresponding to this gene are useful for the 
diagnosis and/or treatment of disorders involving the small intestine. This may include 
diseases associated with digestion and food absorption, as well as hematopoietic 
disorders involving the Peyer's patches of the small intestine, or other hematopoietic cells 
and tissues within the body. Similarly, expression of this gene product in colon tissue 
indicates again involvement in digestion, processing, and elimination of food, as well as 
a potential role for this gene as a diagnostic marker or causative agent in the development 
of colon cancer, and cancer in general. Protein, as well as, antibodies directed against the 
protein may show utility as a tumor marker and/or immunotherapy targets for the above 
listed tissues. 

Many polynucleotide sequences, such as EST sequences, are publicly available 
and accessible through sequence databases. Some of these sequences are related to SEQ 
ID NO: 13 and may have been publicly available prior to conception of the present 
invention. Preferably, such related polynucleotides are specifically excluded from the 
scope of the present invention. To list every related sequence is cumbersome. 
Accordingly, preferably excluded from the present invention are one or more 
polynucleotides comprising a nucleotide sequence described by the general formula of a- 
b, where a is any integer between 1 to 1325 of SEQ ID NO: 13, b is an integer of 15 to 
1339, where both a and b correspond to the positions of nucleotide residues shown in 
SEQ ID NO: 13, and where b is greater than or equal to a + 14. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 4 

The translation product of this gene shares sequence homology with a 
proteoglycan from Gallus gallus, and this proteoglycan is believed to participate in the 
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osteogenic processes of cartilage ossification (See Genbank Accession No. gi|222847). 
Based on the sequence similarity. The translation product of this gene is expected to 
share biological activities with the Gallus gallus proteoglycan polypeptide. 

Figures 10A-B shows the nucleotide (SEQ ID NO: 14) and deduced amino acid 
sequence (SEQ ID NO:32) of the retinal specific protein. Predicted amino acids from 
about 1 to about 21 constitute the predicted signal peptide (amino acid residues from 
about 1 to about 21 in SEQ ID NO:32) and are represented by the underlined amino acid 
regions. 

Figure 1 1 shows the regions of similarity between the amino acid sequences of 
the retinal specific protein SEQ ID NO:32, and the Gallus gallus proteoglycan (SEQ ID 
NO:79). 

Figure 12 shows an analysis of the amino acid sequence of SEQ ID NO: 32. 

Alpha, beta, turn and coil regions; hydrophilicity and hydrophobicity; 
amphipathic regions; flexible regions; antigenic index and surface probability are shown. 

Northern analysis indicates that this gene is expressed in adrenal cortex and 
adrenal medulla tissues. This gene is also expressed in retinal tissue. 

The present invention provides isolated nucleic acid molecules comprising a 
polynucleotide encoding the polypeptide having the amino acid sequence shown in 
Figures I0A-B (SEQ ID NO:32), which was determined by sequencing a cloned cDNA 
(HARA044). The nucleotide sequence shown in Figures 10A-B (SEQ ID NO: 14) was 
obtained by sequencing a cloned cDNA (HARA044), which was deposited on Nov. 17, 
1998 at the American Type Culture Collection, and given Accession Number 203484. 
The deposited gene is inserted in the pSport plasmid (Life Technologies, Rockville, MD) 
using the Sall/NotI restriction endonuclease cleavage sites. 

The present invention is further directed to fragments of the isolated nucleic acid 
molecules described herein. By a fragment of an isolated DNA molecule having the 
nucleotide sequence of the deposited cDNA or the nucleotide sequence shown in SEQ ID 


SUBSTITUTE SHEET (RULE 26) 


WO 00/29435 PCTYUS99/25031 

55 

NO: 14 is intended DNA fragments at least about 15nt, and more preferably at least about 
20 nt, still more preferably at least about 30 nt, and even more preferably, at least about 
40 nt in length which are useful as diagnostic probes and primers as discussed herein. Of 
course, larger fragments 50-1500 nt in length are also useful according to the present 
5 invention, as are fragments corresponding to most, if not all, of the nucleotide sequence 
of the deposited cDNA or as shown in SEQ ID NO: 14. By a fragment at least 20 nt in 
length, for example, is intended fragments which include 20 or more contiguous bases 
from the nucleotide sequence of the deposited cDNA or the nucleotide sequence as 
shown in SEQ ID NO: 14. In this context "about" includes the particularly recited size, 
10 larger or smaller by several (5, 4, 3, 2, or 1) nucleotides, at either terminus or at both 
termini. 

Representative examples of polynucleotide fragments of the invention include, 
for example, fragments that comprise, or alternatively, consist of, a sequence from about 
nucleotide 1 to about 50, from about 51 to about 100, from about 101 to about 150, from 

15 about 151 to about 200, from about 201 to about 250, from about 25 1 to about 300, from 
about 301 to about 350, from about 351 to about 400, from about 401 to about 450, from 
about 451 to about 500, and from about 501 to about 550, and from about 551 to about 
600, from about 601 to about 650, from about 651 to about 700, from about 701 to about 
750, from about 751 to about 800, from about 801 to about 850, from about 851 to about 

20 900, from about 901 to about 950, from about 951 to about 1000, from about 1001 to 

about 1050, from about 1051 to about 1100, from about 1101 to about 1150, from about 
1151 to about 1200, from about 1201 to about 1250, from about 1251 to about 1300, 
from about 1301 to about 1350, from about 1351 to about 1389, and from about 187 to 
about 1 1 19 of SEQ ID NO; 14, or the complementary strand thereto, or the cDNA 

25 contained in the deposited gene. In this context "about" includes the particularly recited 
ranges, larger or smaller by several (5, 4, 3, 2, or 1) nucleotides, at either terminus or at 
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both termini. In additional embodiments, the polynucleotides of the invention encode 
functional attributes of the corresponding protein. 

Preferred embodiments of the invention in this regard include fragments that 
comprise alpha-helix and alpha-helix forming regions ("alpha-regions"), beta-sheet and 
beta-sheet forming regions ("beta-regions"), turn and turn-forming regions ("turn- 
regions"), coil and coil-forming regions ("coil-regions"), hydrophilic regions, 
hydrophobic regions, alpha amphipathic regions, beta amphipathic regions, flexible 
regions, surface-forming regions and high antigenic index regions. The data representing 
the structural or functional attributes of the protein set forth in Figure 12 and/or Table IV, 
as described above, was generated using the various modules and algorithms of the 
DNA*STAR set on default parameters. In a preferred embodiment, the data presented in 
columns VIII, IX, XIII, and XIV of Table IV can be used to determine regions of the 
protein which exhibit a high degree of potential for antigenicity. Regions of high 
antigenicity are determined from the data presented in columns VIII, IX, XIII, and/or 
XIV by choosing values which represent regions of the polypeptide which are likely to 
be exposed on the surface of the polypeptide in an environment in which antigen 
recognition may occur in the process of initiation of an immune response. 

Certain preferred regions in these regards are set out in Figure 12, but may, as 
shown in Table IV, be represented or identified by using tabular representations of the 
data presented in Figure 12. The DNA*STAR computer algorithm used to generate 
Figure 12 (set on the original default parameters) was used to present the data in Figure 
12 in a tabular format (See Table IV). The tabular format of the data in Figure 12 is used 
to easily determine specific boundaries of a preferred region. The above-mentioned 
preferred regions set out in Figure 12 and in Table IV include, but are not limited to, 
regions of the aforementioned types identified by analysis of the amino acid sequence set 
out in Figures 10A-B. As set out in Figure 12 and in Table IV, such preferred regions 
include Garni er-Robson alpha-regions, beta-regions, turn-regions, and coil-regions, 
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Chou-Fasman alpha-regions, beta-regions, and turn-regions, Kyte-Doolittle hydrophilic 
regions and Hopp- Woods hydrophobic regions, Eisenberg alpha- and beta-amp hi path ic 
regions, Karplus-Schulz flexible regions, Jameson-Wolf regions of high antigenic index 
and Emini surface-forming regions. Even if deletion of one or more amino acids from the 
N-terminus of a protein results in modification of loss of one or more biological 
functions of the protein, other functional activities (e.g., biological activities, ability to 
multimerize, etc.) may still be retained. For example, the ability of shortened muteins to 
induce and/or bind to antibodies which recognize the complete or mature forms of the 
polypeptides generally will be retained when less than the majority of the residues of the 
complete or mature polypeptide are removed from the N terminus. Whether a particular 
polypeptide lacking N-terminal residues of a complete polypeptide retains such 
immunologic activities can readily be determined by routine methods described herein 
and otherwise known in the art. It is not unlikely that a mutein with a large number of 
deleted N-terminal amino acid residues may retain some biological or immunogenic 
activities. In fact, peptides composed of as few as six amino acid residues may often 
evoke an immune response. 

Accordingly, the present invention further provides polypeptides having one or 
more residues deleted from the amino terminus of the amino acid sequence shown in 
Figures 10A B, up to the proline residue at position number 327 and polynucleotides 
encoding such polypeptides. In particular, the present invention provides polypeptides 
comprising the amino acid sequence of residues nl-332 of Figures 10A-B, where nl is an 
integer from 2 to 327 corresponding to the position of the amino acid residue in Figures 
10A-B (which is identical to the sequence shown as SEQ ID NO:32). N-terminal 
deletions of the polypeptide of the invention shown as SEQ ID NO;32 include 
polypeptides comprising the amino acid sequence of residues: R-2 to T-332; L-3 to T- 
332; L-4 to T-332; A-5 to T-332;F-6 to T-332; L-7 to T-332; S-8 to T-332; L-9 to T-332; 
L-10 to T-332; A-l 1 to T-332; L-12 to T-332; V-13 to T-332;L-14 to T-332; Q-15 to T- 
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332; E-16 to T-332; T-17 to T-332; G-18 to T-332; T-19 to T-332; A-20 to T-332; S-2I 
toT-332; L-22 to T-332; P-23 to T-332; R-24 to T-332; K-25 to T-332; E-26 to T-332; R- 
27 to T-332; K-28 to T-332;R-29 to T-332; R-30 to T-332; E-31 to T-332; E-32 to T- 
332; Q-33 to T-332; M-34 to T-332; P-35 to T-332; R-36 toT-332; E-37 to T-332; G-38 
to T-332; D-39 to T-332; S-40 to T-332; F-41 to T-332; E-42 to T-332; V-43 to T-332;L- 
44 to T-332; P-45 to T-332; L-46 to T-332; R-47 to T-332; N-48 to T-332; D-49 to T- 
332; V-50 to T-332: L-51 toT-332; N-52 to T-332; P-53 to T-332; D-54 to T-332; N-55 
to T-332; Y-56 to T-332; G-57 to T-332; E-58 to T-332;V-59 to T-332; 1-60 to T-332; D- 
61 to T-332; L-62 to T-332; S-63 to T-332; N-64 to T-332; Y-65 to T-332; E-66 toT- 
332; E-67 to T-332; L-68 to T-332; T-69 to T-332; D-70 to T-332; Y-71 to T-332; G^72 
to T-332; D-73 to T-332;Q-74 to T-332; L-75 to T-332; P-76 to T-332; E-77 to T-332; 
V-78 to T-332; K-79 to T-332; V-80 to T-332; T-81 toT-332; S-82 to T-332; L-83 to T- 
332; A-84 to T-332; P-85 to T-332; A-86 to T-332; T-87 to T-332; S-88 to T-332;I-89 to 
T-332; S-90 to T-332; P-91 to T-332; A-92 to T-332; K-93 to T-332; S-94 to T-332; T- 
95 to T-332; T-96 toT-332; A-97 to T-332; P-98 to T-332; G-99 to T-332; T-100 to T- 
332; P-101 to T-332; S-102 to T-332; S-103 toT-332; N-104 to T-332; P-105 to T-332; 
T-106 to T-332; M-107 to T-332; T-108 to T-332; R-109 to T-332; P-110 toT-332; T- 
1 1 1 to T-332; T-112 to T-332; A-l 13 to T-332; G-l 14 to T-332; L-l 15 to T-332; L-l 16 
to T-332; L-l 17 toT-332; S-l 18 to T-332; S-l 19 to T-332; Q-120 to T-332; P-121 to T- 
332; N-122 to T-332; H-123 to T-332; G-124 toT-332; L-125 to T-332; P-126 to T-332; 
T-127 to T-332; C-128 to T-332; L-129 to T-332; V-130 to T-332; C-131 toT-332; V- 
132 to T-332; C-133 to T-332; L-134 to T-332; G-135 to T-332; S-136 to T-332; S-137 
to T-332; V-138 toT-332; Y-139 to T-332; C-140 to T-332; D-141 to T-332; D-142 to T- 
332; 1-143 to T-332; D-144 to T-332; L-145 toT-332; E-146 to T-332; D-147 to T-332; 
1-148 to T-332; P-149 to T-332; P-150 to T-332; L-151 to T-332; P-152 toT-332; R-153 
to T-332; R-154 to T-332; T-155 to T-332; A-156 to T-332; Y-157 to T-332; L-158 to T- 
332; Y-159 toT-332; A-160 to T-332; R-161 to T-332; F-162 to T-332; N-163 to T-332; 
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R-164 to T-332; 1-165 to T-332; S-166 toT-332; R-167 to T-332; M68 to T-332; R-169 
to T-332; A- 170 to T-332; E-171 to T-332; D-172 to T-332; F-173 toT-332; K-174 to T- 
332; G-175 to T-332; L-176 to T-332; T-177 to T-332; K-178 to T-332; L-179 to T-332; 
K-180 toT-332; R-181 to T-332; 1-182 to T-332; D-183 to T-332; L-184 to T-332; S-185 
to T-332; N-186 to T-332; N-187 toT-332; L-I88 to T-332; 1-189 to T-332; S-190 to T- 
332; S- 191 to T-332; I 192 to T-332; D-193 to T-332; N-194 toT-332; D-195 to T-332; 
A-196 to T-332; F-197 to T-332; R-198 to T-332; L-199 to T-332; L-200 to T-332; H- 
201 toT-332; A- 202 to T-332; L-203 to T-332; Q-204 to T-332; D-205 to T-332; L-206 
to T-332; 1-207 to T-332; L-208 toT-332; P-209 to T-332; E-210 to T-332; N-21 1 to T- 
332; Q-212 to T-332; L-213 to T-332; E-214 to T-332; A 215 toT-332; L-216 to T-332; 
P-217 to T-332; V-218 to T-332; L-219 to T-332; P-220 to T-332; S-221 to T-332; G- 
222 toT-332; 1-223 to T-332; E-224 to T-332; F-225 to T-332; L-226 to T-332; D-227 to 
T-332; V-228 to T-332; R-229 toT-332; L-230 to T-332; N-231 to T-332; R-232 to T- 
332; L-233 to T-332; Q-234 to T-332; S-235 to T-332; S-236 toT-332; G-237 to T-332; 
1-238 to T-332; Q-239 to T-332; P-240 to T-332; A-241 to T-332; A-242 to T-332; F- 
243 toT-332; R-244 to T-332; A-245 to T-332; M-246 to T-332; E-247 to T-332; K-248 
to T-332; L-249 to T-332; Q-250 toT-332; F-251 to T-332; L-252 to T-332; Y-253 to T- 
332; L-254 to T-332; S-255 to T-332; D-256 to T-332; N-257 toT-332; L-258 to T-332; 
L-259 to T-332; D-260 to T-332; S-261 to T-332; 1-262 to T-332; P-263 to T-332; G-264 
toT-332; P-265 to T-332; L-266 to T-332; P-267 to T-332; P-268 to T-332; S-269 to T- 
332; L-270 to T-332; R-271 toT-332; S-272 to T-332; V-273 to T-332; H-274 to T-332; 
L-275 to T-332; Q-276 to T-332; N-277 to T-332; N-278 toT-332; L-279 to T-332; 1-280 
to T-332; E-281 to T-332; T-282 to T-332; M-283 to T-332; Q-284 to T-332; R-285 toT- 
332; D-286 to T-332; V-287 to T-332; F-288 to T-332; C-289 to T-332; D-290 to T-332; 
P-291 to T-332; E-292 toT-332; E-293 to T-332; H-294 to T-332; K-295 to T-332; H- 
296 to T-332; T-297 to T-332; R-298 to T-332; R-299 toT-332; Q-300 to T-332; L-301 
to T-332; E-302 to T-332; D-303 to T-332; 1-304 to T-332; R-305 to T-332; L-306 toT- 
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332; D-307 to T-332; G-308 to T-332; N-309 to T-332; P-3 10 to T-332; 1-3 1 1 to T-332; 
N-312 to T-332; L-313 toT-332; S-314 to T-332; L-315 to T-332; F-316 to T-332; P-3 17 
to T-332; S-318 to T-332; A-319 to T-332; Y-320 toT-332; F-321 to T-332; C-322 to T- 
332; L-323 to T-332; P-324 to T-332; R-325 to T-332; L-326 to T-332; P-327 toT-332; 
of SEQ ID NO;32. Polypeptides encoded by these polynucleotides are also encompassed 
by the invention. 

Also as mentioned above, even if deletion of one or more amino acids from the 
C-terminus of a protein results in modification or loss of one or more biological 
functions of the protein, other functional activities (e.g., biological activities (e.g., ability 
to illicit mitogenic activity, induce differentiation of normal or malignant cells, bind to 
EGF receptors, etc.)), may still be retained. For example the ability to induce and/or bind 
to antibodies which recognize the complete or mature forms of the polypeptide generally 
will be retained when less than the majority of the residues of the complete or mature 
polypeptide are removed from the C-terminus. Whether a particular polypeptide lacking 
C-terminal residues of a complete polypeptide retains such immunologic activities can 
readily be determined by routine methods described herein and otherwise known in the 
art. It is not unlikely that a mutein with a large number of deleted C-terminal amino acid 
residues may retain some biological or immunogenic activities. In fact, peptides 
composed of as few as six amino acid residues may often evoke an immune response. 

Accordingly, the present invention further provides polypeptides having one or 
more residues deleted from the carboxy terminus of the amino acid sequence of the 
polypeptide shown in Figures 10A-B, up to the glutamine residue at position number 7, 
and polynucleotides encoding such polypeptides. In particular, the present invention 
provides polypeptides comprising the amino acid sequence of residues 1-ml of Figures 
10A-B, where ml is an integer from 7 to 331 corresponding to the position of the amino 
acid residue in Figures 10A-B. Moreover, the invention provides polynucleotides 
encoding polypeptides comprising, or alternatively consisting of, the amino acid 
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sequence of C-terminal deletions of the polypeptide of the invention shown as SEQ ID 
NO:32 include polypeptides comprising the amino acid sequence of residues: M-l to F- 
331; M-l to R-330; M-l to G-329; M-l to 1-328; M-l to P-327; M-l to L-326; M-l to R- 
325; M-l to P-324; M-l to L-323; M-l to C-322; M-l to F-321; M-l to Y-320; M-l to A- 
319; M-l to S-318; M-l toP-317; M-l to F-316; M-l toL-315; M-l to S-314; M-l toL- 
313; M-lto N-312; M-l to 1-311; M-l toP-310; M-l to N-309; M-l to G-308; M-l to D- 
307; M-l to L-306; M-l to R-305; M-lto 1-304; M-l to D-303; M-l to E-302; M-l to L- 
301; M-l to Q-300; M-l to R-299; M-l to R-298; M-l to T-297; M-lto H-296; M-l to 
K-295;M-1 to H-294; M-l to E-293; M-l to E-292; M-l toP-291; M-l toD-290;M-l 
to C-289; M-lto F-288; M-l to V-287; M-l to D-286; M-l to R-285; M-l to Q-284; M-l 
to M-283; M-l to T-282; M-l to E-281; M-lto 1-280; M-l to L-279; M-l to N-278; M-l 
to N-277; M-l to Q-276; M-l to L-275; M-l to H-274; M-l to V-273; M-lto S-272; M-l 
to R-271; M-l to L-270; M-l to S-269; M-l to P-268; M-l to P-267; M-l to L-266; M-l 
to P-265; M-lto G-264; M-l to P-263; M-l to 1-262; M-l to S-261; M-l to D-260; M-l 
to L-259; M-l to L-258; M-l to N-257; M-lto D-256; M-l to S-255; M-l to L-254; M-l 
to Y-253; M-l to L-252; M-l to F-251; M-l to Q-250; M-l to L-249; M-lto K-248; M-l 
to E-247; M-l to M-246; M-l to A-245; M-l to R-244; M-l to F-243; M-l to A-242; M- 
1 to A-241 ;M-1 to P-240; M-l to Q-239; M-l to 1-238; M-l to G-237; M-l to S-236; M- 
1 to S-235; M-l to Q-234; M-l to L-233;M-1 to R-232; M-l to N-231; M-l to L-230; M- 
1 to R-229; M-l to V-228; M-l to D-227; M-l to L-226; M-l to F-225;M-1 to E-224; M- 
1 to 1-223; M-l to G-222; M-l to S-221; M-l to P-220; M-l to L-219; M-l to V-218; M- 
1 to P-217;M-1 to L-216; M-l to A-215; M-l to E-214; M-l to L-213; M-l to Q-212; M- 
1 to N-21 1; M-l to E-210; M-l to P-209;M-1 to L-208; M-l to 1-207; M-l to L-206; M- 
1 to D-205; M-l to Q-204; M-l to L-203; M-l to A-202; M-l to H-201;M-1 to L-200; 
M-l to L-199; M-l to R-198; M l to F-197; M-l to A-196; M-l to D-195; M-l to N- 
194; M-l to D-193;M-1 to 1-192; M-l to S-191;M-1 to S-190; M-l to 1-189; M-l to L- 
188; M-l to N-187; M-l to N-186; M-l to S-185;M-1 to L-184; M-I to D-183; M-l to I- 
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182; M-l toR-181;M-l to K-180; M-l to L-179;M-1 to K-178; M-l to T-177;M-i to 
L-176; M-l to G-175; M-l to K-174; M-l to F-173; M-l to D-172; M-l toE-171; M-l 
to A-170; M-l to R-169;M-1 to 1-168; M-l to R-167; M-l to S-166; M-l to 1-165; M-l 
to R-164; M-l to N-163; M-l toF-162; M-l to R-161;M-1 to A-160; M-l to Y-159; M-l 
to L-158; M-l to Y-157; M-l to A-156; M-l to T-155; M-l to R-154; M-l to R-I53;M-i 
to P-152;M-1 to L-I5I;M-1 to P-150; M-l toP-149;M-l to 1-148; M-l toD-I47;M-l 
to E-146; M-l to L-145;M-1 toD-144; M-I to 1-143; M-l to D- 142; M-l to D-141;M-1 
to C-140; M-l to Y-139; M-l to V-138; M-l to S-137;M-1 to S-136; M-l to G-I35; M-l 
to L-134; M-I to C-133; M-l to V-132; M-I to C-131; M-l to V-130; M-l to L-129;M-1 
to C-128;M-1 to T- 127; M-l to P-126; M-l to L-125;M-1 to G-124; M-l to H-123;M-1 
to N-122; M-l to P-121;M-1 to Q-120; M-l to S-l J9; M-l to S-118; M-l to L-1 17; M-l 
to L-1 16; M-I to L-1 15; M-l to G-114; M-l to A-113;M-1 to T-112;M-1 toT-lll;M-l 
to P-110; M-l to R-109; M-l to T-108; M-l to M-107; M-l to T-106; M-l to P-105;M-1 
to N-104; M-l to S-103; M-l to S-102; M-l to P-101; M-l to T-100; M-l to G-99: M-l 
to P-98; M-l to A-97;M-1 to T-96; M-l to T-95; M-l to S-94; M-l to K-93; M-l to A- 
92; M-l to P 91; M-l to S-90; M-l to 1-89; M-l toS-88; M-l to T-87; M-l to A-86; M-l 
to P~85;M-1 to A-84; M-l toL-83;M-l to S-82;M-1 toT-81;M-l to V-80; M-lto K- 
79; M-l to V-78; M-l to E-77; M-l to P-76; M-l to L-75; M-l to Q-74; M-l to D-73; 
M-l to G-72; M-l to Y-71;M-1 to D-70; M-l to T-69; M-l to L-68; M-l to E-67; M-l to 
E-66; M-l to Y-65; M-l to N-64; M-l to S-63; M-l toL-62; M-l to D-61; M-l to 1-60; 
M-l to V-59; M-l to E-58; M-l to G-57; M-l to Y-56; M-l to N-55; M-l to D-54;M-1 
to P-53; M-l to N-52; M-l to L-51; M-l to V-50; M-l to D-49; M-l to N-48; M-l to R^ 
47; M-l to L-46; M-l toP-45; M-l to L-44; M-l to V-43; M-l to E-42; M-I to F-41 ; M-l 
to S-40; M-l to D-39; M-l to G-38; M-l to E-37; M-lto R-36; M-l to P-35; M-l to M- 
34; M-l to Q-33; M-l to E-32; M-l to E-31; M-l to R-30; M-l to R-29; M-l to K-28;M- 
1 to R-27; M-I to E-26; M-l to K-25; M-l to R-24; M-l to P-23; M-l to L-22; M-l to S- 
21; M-l to A-20; M-l toT-19; M-l to G-18; M-l toT-17; M-l to E-16; M-l to Q-15;M- 
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1 to L-14; M-l to V-13;M-1 to L-12; M-l to A-11;M-1 toL-10; M-l to L-9; M-l to S-8; 
M-l to L-7; of SEQ ID NO:32. Polypeptides encoded by these polynucleotides are also 
encompassed by the invention. 

In addition, the invention provides nucleic acid molecules having nucleotide 
sequences related to extensive portions of SEQ ID NO: 14 which have been determined 
from the following related cDNA genes: HARAY79R (SEQ ID NO: 80), HARA044R 
(SEQ ID NO:81), HARAJ74R (SEQ ID NO:82), HARAO66R (SEQ ID NO:83), 
HARAN19R (SEQ ID NO:84), and HARAT78R (SEQ ID NO:85). 

Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identification of the tissue(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditions which include, but are not 
limited to, retinal disorders. Similarly, polypeptides and antibodies directed to these 
polypeptides are useful in providing immunological probes for differential identification 
of the tissue(s) or cell type(s). For a number of disorders of the above tissues or cells, 
particularly of the retina, expression of this gene at significantly higher or lower levels is 
routinely detected in certain tissues or cell types (e.g. retinal, cancerous and wounded 
tissues) or bodily fluids (e.g., lymph, serum, plasma, urine, synovial fluid and spinal 
fluid) or another tissue or cell sample taken from an individual having such a disorder, 
relative to the standard gene expression level, i.e., the expression level in healthy tissue 
or bodily fluid from an individual not having the disorder. 

Preferred polypeptides of the present invention comprise immunogenic epitopes 
shown in SEQ ID NO: 32 as residues: Leu-22 to Asp-39, Asn-64 to Pro-76, Pro-98 to 
Thrill, Pro-291 to Glu-302. Polynucleotides encoding said polypeptides are also 
provided. 

The tissue distribution in retinal tissue, and the homology to a Gallus gallus 
proteoglycan involved in the ossification process indicates that polynucleotides and 
polypeptides corresponding to this gene are useful for the treatment of disorders of the 
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retina which involve the adhesion of tissues, or the binding of certain proteins to the cell 
surface. The translation products of this gene are useful for the treatment of retinal 
disorders such as retinal detachment in individuals suffering from myopia, or in the 
treatment of macular degeneration. Furthermore, this gene may serve as a tumor marker 
5 for retinoblastomas, or related tumors. More generally, the tissue distribution in retinal 
tissue indicates that The translation product of this gene is useful for the diagnosis, 
detection and/or treatment of eye disorders including blindness, color blindness, impaired 
vision, short and long sightedness, retinitis pigmentosa, retinitis proliferans, and 
retinoblastoma, retinochoroiditis, retinopathy and retinoschisis. Based upon the tissue 

10 distribution of this protein, antagonists directed against this protein is useful in blocking 
the activity of this protein. Accordingly, preferred are antibodies which specifically bind 
a portion of the translation product of this gene. 

Also provided is a kit for detecting tumors in which expression of this protein 
occurs. Such a kit comprises in one embodiment an antibody specific for the translation 

15 product of this gene bound to a solid support. Also provided is a method of detecting 
these tumors in an individual which comprises a step of contacting an antibody specific 
for the translation product of this gene to a bodily fluid from the individual, preferably 
serum, and ascertaining whether antibody binds to an antigen found in the bodily fluid. 
Preferably the antibody is bound to a solid support and the bodily fluid is serum. The 

20 above embodiments, as well as other treatments and diagnostic tests (kits and methods), 
are more particularly described elsewhere herein. Furthermore, the protein may also be 
used to determine biological activity, to raise antibodies, as tissue markers, to isolate 
cognate ligands or receptors, to identify agents that modulate their interactions, in 
addition to its use as a nutritional supplement. Protein, as well as, antibodies directed 

25 against the protein may show utility as a tumor marker and/or immunotherapy targets for 
the above listed tissues. 


SUBSTITUTE SHEET (RULE 26) 


WO 00/29435 PCT/US99/25031 

65 

Many polynucleotide sequences, such as EST sequences, are publicly available 
and accessible through sequence databases. Some of these sequences are related to SEQ 
ID NO: 14 and may have been publicly available prior to conception of the present 
invention. Preferably, such related polynucleotides are specifically excluded from the 
5 scope of the present invention. To list every related sequence is cumbersome. 
Accordingly, preferably excluded from the present invention are one or more 
polynucleotides comprising a nucleotide sequence described by the general formula of a- 
b, where a is any integer between 1 to 1375 of SEQ ID NO: 14, b is an integer of 15 to 
1389, where both a and b correspond to the positions of nucleotide residues shown in 
10 SEQ ID NO: 14, and where b is greater than or equal to a + 14. 


FEATURES OF PROTEIN ENCODED BY GENE NO: 5 

15 The translation product of this gene shares sequence homology with the CD33 

protein (See Genbank Accession No. gi|2913995). The expression pattern of CD33 
within the hematopoietic system indicates a potential role in the regulation of myeloid 
cell differentiation. However, this expression is absent from hematopoietic stem cells. 
CD33 is expressed in clonogenic leukemia cells in about 90% of patients suffering from 

20 acute myeloid leukemia (AML). While about 60-70% of adults suffering from AML 

experience complete remission due to chemotherapy application, most of these patients 
will ultimately die of relapsed leukemia. It is believed that, like CD33, the CD33-like 
protein of the present invention is also expressed by clonogenic leukemia cells from the 
vast majority of patients with AML. Thus, there is a clear need to identify and isolate 

25 nucleic acid molecules encoding additional polypeptides having 

CD33-Hke protein activity. It is believed that cancerous tissue contains significantly 
greater amounts of CD33-like protein gene copy number and expresses significantly 
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enhanced levels of CD33-Hke protein and mRNA encoding the CD33-like protein when 
compared to a "standard" mammal, i.e. -a mammal of the same species not having the 
cancer or inflammatory disease. Thus, enhanced levels of the CD33-like protein will be 
detected in certain bodily fluids (e.g., serum, plasma, urine, synovial fluid and spinal 
fluid) from mammals when compared to sera from mammals of the same species not 
having the cancer or inflammatory disease. 

Two related cDNA genes, HDPIB36 and HEOMH10, have been isolated. These 
cDNA genes appear to encode splice variants of this gene. Preferred polynucleotides 
comprise the following sequences: 

CGACCCACGCGTCCGCCGCCTTCGGCTTCCCCTTCTGCCAA 

GAGCCCTGAGCCACTCACAGCACGACCAGAGA (SEQ ID NO: 86), 

GTATGGAATGGGGTGGGAACCCCTGCCTCTCACACTGGGGAGGGACCCTGGG 

GACAGCCTATGGGCTGAGCAGAGAGGGCTCTCAGGGACCCCTGCAGCACAA 

GAATCTCCCACCACGGTCTCTGTCCCAGCCCTGACTCAGAAGCCTGATGTCTA 

CATCCCCGAGACCCTGGAGCCCGGGCAGCCGGTGACGGTCATCTGTGTGTTT 

AACTGGGCCTTTGAGGAATGTCCACCCCCTTCTTTCTCCTGGACGGGGGCTGC 

CCTCTCCTCCCAAGGAACCAAACCAACGACCTCCCACTTCTCAG (SEQ ID NO: 

87), 

ATCCTCCAGAGAACCTGAGAGTGATGGTTTCCCAAGCAAACAGGACAGGTA 
GGAAAGGGGACAGAGGAGCCAAGGCCTCTCAGTGCCGAATTGGGGGCCCAG 
GAGTCTGGAGGGTCCCCACGCAGGAGGGTCCCTGAGCCCTGAGCTGCTCATC 
GATTCTGCCTCTTCCTTCCCT (SEQ ID NO: 88), 

GTGAGTGGGGGAAAGGGGACACCTGGGTCCCAGGAAGGGGACCCTGCTGAG 
TCCTGTCCTCCCTCCCCTCAG (SEQ ID NO: 89), 

CTGGCCCCCTGGCTCAGAAGCGGAATCAGAAAGCCACACCAAACAGTCCTCG 
GACCCCTCTTCCACCAGGTGCTCCCTCCCCAGAATCAAAGAAGAACCAGAAA 
AAGCAGTATCAGTTGCCCAGTTTCCCAGAACCCAAATCATCCACTCAAGCCC 
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CAGAATCCCAGGAGAGCCAAGAGGAGCTCCATTATGCCACGCTCAACTTCCC 
AGGCGTCAGACCCAGGCCTGAGGCCCGGATGCCCAAGGGCACCCAGGCGGA 
TTATGCAGAAGTCAAGTTCCAATGAGGGTCTCTTAGGCTTTAGGACTGGGAC 
TTCGGCTAGGGAGGAAGGTAGAGTAAGAGGTTGAAGATAACAGAGTGCAAA 
GTTTCCTTCTCTCCCTCTCTCTCTCTCTTTCTCTCTCTCTCTCTCTTTCTCTCTCT 
TTT (SEQ ID NO: 90), anchor 

AAAAAAACATCTGGCCAGGGCACAGTGGCTCACGCCTGTAATCCCAGCACTT 

TGGGAGGTTGAGGTGGGCAGATCGCCTGAGGTCGGGAGTTCGAGACCAGCC 

TGGCCAACTTGGTGAAACCCCGTCTCTACTAAAAATACAAAAATTAGCTGGG 

CATGGTGGCAGGCGCCTGTAATCCTACTACTTGGGAAGCTGAGGCAGGAGAA 

TCACTTGAACCTGGGAGACGGAGGTTGCAGTGAGCCAAGATCACACCATTGC 

ACGCCAGCTTGGGCAACAAAGCGAGACTCCATCTCAAAAAAAAAATCCTCC 

AAATGGGTTGGGTGTCTGTAATCCCAGCACTTTGGGAGGCTAAGGTGGGTGG 

ATTGCTTGAGCCCAGGAGTTCGAGACCAGCCTGGGCAACATGGTGAAACCCC 

ATCTCTACAAAAAATACAAAACATAGCTGGGCTTGGTGGTGTGTGCCTGTAG 

TCCCAGCTGTCAGACATTTAAACCAGAGCAACTCCCATCTGGAATGGGAGCT 

GAATAAAATGAGGCTGAGACCTACTGGGCTGCCATTCTCAGACAGTGGAGGC 

CATTCTAAGTCACAGGATGAGACAGGAGGTCCGTACAAGATACAGGTCATA 

AAGACTTTGCTGATAAAACAGATTGCAGTAAAGAAGCCAACCAAATCCCACC 

AAAACCAAGTTGGCCACGAGAGTGACCTCTGGTCGTCCTCACTGCTACACTC 

CTGACAGCACCATGACAGTTTACAAATGCCATGGCAACATCAGGAAGTTACC 

CGATATGTCCCAAAAGGGGGAGGAATGAATAATCCACCCCTTGTTTAGCAAA 

TAAGCAAGAAATAACCATAAAAGTGGGCAACCAGCAGCTCTAGGCGCTGCT 

CTTGTCTATGGAGTAGCCATTCTTTTGTTCCTTTACTTTCTTAATAAACTTGCT 

TTCACCTTAAAAAAAAAAAAAAAAAAAAAA (SEQ ID NO:91). Also preferred 

are the polypeptides encoded by these polynucleotides. 
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Figures 13A-C shows the nucleotide (SEQ ID NO: 15) and deduced amino acid 
sequence (SEQ ID NO:33) of the CD33-like protein. Predicted amino acids from about 1 
to about 16 constitute the predicted signal peptide (amino acid residues from about 1 to 
about 16 in SEQ ID NO:33) and are represented by the underlined amino acid regions; 
and amino acids from about 496 to about 512 constitute the predicted transmembrane 
domain (amino acid residues from about 496 to about 512 in SEQ ID NO:33) and are 
represented by the double-underlined amino acid regions. 

Figure 14 shows the regions of similarity between the amino acid sequences of 
the CD33-like protein SEQ ID NO;33, and the CD33L1 protein (gi|88178) (SEQ ID NO: 
92). 

Figure 15 shows an analysis of the amino acid sequence of SEQ ID NO:33. 

Alpha, beta, turn and coil regions; hydrophilicity and hydroph obi city; 
amphipathic regions; flexible regions; antigenic index and surface probability are shown. 

Northern analysis indicates that this gene is expressed highest in spleen tissue and 
peripheral blood leukocytes, and to a lesser extent in ovary and lung tissue. 

The present invention provides isolated nucleic acid molecules comprising a 
polynucleotide encoding the polypeptide having the amino acid sequence shown in 
Figures 13A-C (SEQ ID NO: 33), which was determined by sequencing a cloned cDNA 
(HDPCL05). The nucleotide sequence shown in Figures 13A-C (SEQ ID NO: 15) was 
obtained by sequencing a cloned cDNA (HDPCL05), which was deposited on Nov. 17, 
1998 at the American Type Culture Collection, and given Accession Number 203484. 
The deposited gene is inserted in the pSport plasmid (Life Technologies, Rockville, MD) 
using the Sall/NotI restriction endonuclease cleavage sites. 

The present invention is further directed to fragments of the isolated nucleic acid 
molecules described herein. By a fragment of an isolated DNA molecule having the 
nucleotide sequence of the deposited cDNA or the nucleotide sequence shown in SEQ ID 
NO: 15 is intended DNA fragments at least about 15nt, and more preferably at least about 
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20 nt, still more preferably at least about 30 nt, and even more preferably, at least about 
40 nt in length which are useful as diagnostic probes and primers as discussed herein. Of 
course, larger fragments 50-1500 nt in length are also useful according to the present 
invention, as are fragments corresponding to most, if not all, of the nucleotide sequence 
of the deposited cDNA or as shown in SEQ ID NO: 15. By a fragment at least 20 nt in 
length, for example, is intended fragments which include 20 or more contiguous bases 
from the nucleotide sequence of the deposited cDNA or the nucleotide sequence as 
shown in SEQ ID NO: 15. In this context "about" includes the particularly recited size, 
larger or smaller by several (5, 4, 3, 2, or 1) nucleotides, at either terminus or at both 
termini. Representative examples of polynucleotide fragments of the invention include, 
for example, fragments that comprise, or alternatively, consist of, a sequence from about 
nucleotide 1 to about 50, from about 51 to about 100, from about 101 to about 150, from 
about 151 to about 200, from about 201 to about 250, from about 251 to about 300, from 
about 301 to about 350, from about 351 to about 400, from about 401 to about 450, from 
about 451 to about 500, from about 501 to about 550, from about 551 to about 600, from 
about 601 to about 650, from about 651 to about 700, from about 701 to about 750, from 
about 751 to about 800, from about 801 to about 850, from about 851 to about 900, from 
about 901 to about 950, from about 951 to about 1000, from about 1001 to about 1050, 
from about 1051 to about 1100, from about 1101 to about 1 150, from about 1151 to 
about 1200, from about 1201 to about 1250, from about 1251 to about 1300, from about 
1301 to about 1350, from about 1351 to about 1400, from about 1401 to about 1450, 
from about 1451 to about 1500, from about 1501 to about 1550, from about 1551 to 
about 1600, from about 1601 to about 1650, from about 1651 to about 1700, from about 
1701 to about 1750, from about 1751 to about 1800, from about 1801 to about 1850, 
from about 1851 to about 1900, from about 1901 to about 1950, from about 1951 to 
about 2000, from about 2001 to about 2050, from about 2051 to about 2100, from about 
2101 to about 2150, from about 2151 to about 2200, from about 2201 to about 2250, 
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from about 2251 to about 2295, from about 307 to about 1977, and from about 106 to 
about 1977, of SEQ ID NO: 15, or the complementary strand thereto, or the cDNA 
contained in the deposited gene. In this context "about" includes the particularly recited 
ranges, larger or smaller by several (5, 4, 3, 2, or 1) nucleotides, at either terminus or at 
both termini. In additional embodiments, the polynucleotides of the invention encode 
functional attributes of the corresponding protein. 

Preferred embodiments of the invention in this regard include fragments that 
comprise alpha-helix and alpha-helix forming regions ("alpha-regions"), beta-sheet and 
beta-sheet forming regions ("beta-regions"), turn and turn-forming regions ("turn- 
regions"), coil and coil-forming regions ("coil-regions"), hydrophilic regions, 
hydrophobic regions, alpha amphipathic regions, beta amphipathic regions, flexible 
regions, surface-forming regions and high antigenic index regions. The data representing 
the structural or functional attributes of the protein set forth in Figure 15 and/or Table V, 
as described above, was generated using the various modules and algorithms of the 
DNA*STAR set on default parameters. In a preferred embodiment, the data presented in 
columns VIII, IX, XIII, and XIV of Table V can be used to determine regions of the 
protein which exhibit a high degree of potential for antigenicity. Regions of high 
antigenicity are determined from the data presented in columns VIII, IX, XIII, and/or 
XIV by choosing values which represent regions of the polypeptide which are likely to 
be exposed on the surface of the polypeptide in an environment in which antigen 
recognition may occur in the process of initiation of an immune response. 

Certain preferred regions in these regards are set out in Figure 15, but may, as 
shown in Table V, be represented or identified by using tabular representations of the 
data presented in Figure 15. The DNA*STAR computer algorithm used to generate 
Figure 15 (set on the original default parameters) was used to present the data in Figure 
15 in a tabular format (See Table V). The tabular format of the data in Figure 15 is used 
to easily determine specific boundaries of a preferred region. The above-mentioned 
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preferred regions set out in Figure 15 and in Table V include, but are not limited to, 
regions of the aforementioned types identified by analysis of the amino acid sequence set 
out in Figures 13A-C. As set out in Figure 15 and in Table V, such preferred regions 
include Garnier-Robson alpha-regions, beta-regions, turn-regions, and coil-regions, 
Chou-Fasman alpha-regions, beta-regions, and turn-regions, Kyte-Doolittle hydrophilic 
regions and Hopp-Woods hydrophobic regions, Eisenberg alpha- and beta-amphipathic 
regions, Karplus-Schulz flexible regions, Jameson-Wolf regions of high antigenic index 
and Emini surface-forming regions. Even if deletion of one or more amino acids from the 
N-terminus of a protein results in modification of loss of one or more biological 
functions of the protein, other functional activities (e.g., biological activities, ability to 
multimerize, etc.) may still be retained. For example, the ability of shortened muteins to 
induce and/or bind to antibodies which recognize the complete or mature forms of the 
polypeptides generally will be retained when less than the majority of the residues of the 
complete or mature polypeptide are removed from the N-terminus. Whether a particular 
polypeptide lacking N-terminal residues of a complete polypeptide retains such 
immunologic activities can readily be determined by routine methods described herein 
and otherwise known in the art. It is not unlikely that a mutein with a large number of 
deleted N-terminal amino acid residues may retain some biological or immunogenic 
activities. In fact, peptides composed of as few as six amino acid residues may often 
evoke an immune response. 

Accordingly, the present invention further provides polypeptides having one or 
more residues deleted from the amino terminus of the amino acid sequence shown in 
Figures 13A-C, up to the alanine residue at position number 634 and polynucleotides 
encoding such polypeptides. In particular, the present invention provides polypeptides 
comprising the amino acid sequence of residues ni-639 of Figures 13A-C, where nl is an 
integer from 2 to 634 corresponding to the position of the amino acid residue in Figures 
13A-C (which is identical to the sequence shown as SEQ ID NO:33). N-terminal 
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deletions of the polypeptide of the invention shown as SEQ ID NO:33 include 
polypeptides comprising the amino acid sequence of residues: L-2 to Q-639; L-3 to Q- 
639; P-4 to Q-639;L-5 to Q-639; L-6 to Q-639; L-7 to Q-639; S-8 to Q-639; S-9 toQ- 
639; L-10 to Q-639; L-l 1 to Q-639; G-12 to Q-639; G-13 toQ-639; S-14 to Q-639; Q-15 
to Q-639; A-16 to Q-639; M-17 toQ-639; D-18 to Q-639; G-19 to Q-639; R-20 to Q-639; 
F-21 toQ-639; W-22 to Q-639; 1-23 to Q-639; R-24 to Q-639; V-25 toQ-639; Q-26 to Q- 
639; E-27 to Q-639; S-28 to Q-639; V-29 toQ-639; M-30 to Q-639; V-31 to Q-639: P-32 
to Q-639; E-33 toQ-639; A-34 to Q-639; C-35 to Q-639; D-36 to Q-639; 1-37 toQ-639; 
S-38 to Q-639; V-39 to Q-639; P-40 to Q-639; C-41 toQ-639; S-42 to Q-639; F-43 to Q- 
639; S-44 to Q-639; Y-45 toQ-639; P-46 to Q-639; R-47 to Q-639; Q-48 to Q-639; D-49 
toQ-639; W-50 to Q-639; T-5I to Q-639; G-52 to Q-639; S-53 toQ-639; T-54 to Q-639; 
P-55 to Q-639; A-56 to Q-639; Y-57 toQ-639; G-58 to Q-639; Y-59 to Q-639; W-60 to 
Q-639; F-61 toQ-639; K-62 to Q-639; A-63 to Q-639; V-64 to Q-639; T-65 toQ-639; E- 
66 to Q-639; T-67 to Q-639; T-68 to Q-639; K-69 toQ-639; G-70 to Q-639; A-71 to Q- 
639; P-72 to Q-639; V-73 toQ-639; A-74 to Q-639; T-75 to Q-639; N-76 to Q-639; H-77 
toQ-639; Q 78 to Q-639; S-79 to Q-639; R-80 to Q-639; E-81 toQ-639; V-82 to Q-639; 
E-83 to Q-639; M-84 to Q-639; S-85 toQ-639; T-86 to Q-639; R-87 to Q-639; G-88 to 
Q-639; R-89 toQ-639; F-90 to Q-639; Q-91 to Q-639; L-92 to Q-639; T-93 toQ-639; G- 
94 to Q-639; D-95 to Q-639; P-96 to Q-639; A-97 toQ-639; K-98 to Q-639; G-99 to Q- 
639; N-100 to Q-639; C-101 toQ-639; S-102 to Q-639; L-103 to Q-639; V-104 to Q-639; 
1-105 toQ-639; R-106 to Q-639; D-107 to Q-639; A-108 to Q-639; Q-109to Q-639; M- 
110 to Q-639; Q-lll to Q-639; D-112 to Q-639;E-113 to Q-639; S-114 to Q-639; Q-115 
to Q-639; Y-l 16 to Q-639;F-1 17 to Q-639; F-l 18 to Q-639; R-l 19 to Q-639; V-120 to 
Q-639;E-121 to Q-639; R-122 to Q-639; G-123 to Q-639; S-124 toQ-639; Y-125 to Q- 
639; V-126 to Q-639; R-127 to Q-639; Y-128to Q-639; N-129 to Q-639; F-130 to Q- 
639; M 131 to Q-639;N-132 to Q-639; D-133 to Q-639; G-134 to Q-639; F-135 toQ- 
639; F-136 to Q-639; L-137 to Q-639; K-138 to Q-639; V-139to Q-639; T-140 to Q-639; 
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V-141 to Q-639; L-142 to Q-639; S-143to Q-639; F-144 to Q-639; T-145 to Q-639; P- 
146 to Q-639; R-147to Q-639; P-148 to Q-639; Q-149 to Q-639; D-150 to Q-639;H-151 
to Q-639; N-152 to Q-639; T-153 to Q-639; D-154 toQ-639; L-155 to Q-639; T-156 to 
Q-639; C-157 to Q-639; H-158to Q-639; V-159 to Q-639; D-160 to Q-639; F-161 to Q- 
639; S-162to Q-639; R-163 to Q-639; K-164 to Q-639; G-I65 to Q-639;V-166 to Q-639; 
S-167 to Q-639; A-168 to Q-639; Q-169 toQ-639; R-170 to Q-639; T-171 to Q-639; V- 
172 to Q-639; R-173to Q-639; L-174 to Q-639; R-175 to Q-639; V-176 to Q-639;A-177 
to Q-639; Y-178 to Q-639; A-179 to Q-639; P-180 toQ-639; R-181 to Q-639; D-182 to 
Q-639; L-183 to Q-639; V-184to Q-639; 1-185 to Q-639; S-186 to Q-639; 1-187 to Q- 
639; S-188to Q-639; R-189 to Q-639; D-190 to Q-639; N-191 to Q-639;T-192 to Q-639; 
P-193 to Q-639; A- 194 to Q-639; L-195 to Q-639;E-196 to Q-639; P-197 to Q-639; Q 
198 to Q-639; P-199 to Q-639;Q-200 to Q-639; G-201 to Q-639; N-202 to Q-639; V-203 
toQ-639; P-204 to Q-639; Y-205 to Q-639; L-206 to Q-639; E-207 toQ-639; A-208 to Q- 
639; Q-209 to Q-639; K-210 to Q-639; G-21 lto Q-639; Q-212 to Q-639; F-213 to Q- 
639; L-214 to Q-639; R-215to Q-639; L-216 to Q-639; L-217 to Q-639; C-218 to Q-639; 
A-219to Q-639; A-220 to Q-639; D-221 to Q-639; S-222 to Q-639;Q-223 to Q-639; P- 
224 to Q-639; P-225 to Q-639; A-226 to Q-639;T-227 to Q-639; L-228 to Q-639; S-229 
to Q-639; W-230 toQ-639; V-231 to Q-639; L-232 to Q-639; Q-233 to Q-639; N-234to 
Q-639; R-235 to Q-639; V-236 to Q-639; L-237 to Q-639; S-238to Q-639; S-239 to Q- 
639; S-240 to Q-639; H-241 to Q-639; P-242to Q-639; W-243 to Q-639; G-244 to Q- 
639; P-245 to Q-639;R-246 to Q-639; P-247 to Q-639; L-248 to Q-639; G-249 toQ-639; 
L-250 to Q-639; E-251 to Q-639; L-252 to Q-639; P-253 toQ-639; G-254 to Q-639; V- 
255 to Q-639; K-256 to Q-639; A-257to Q-639; G-258 to Q-639; D-259 to Q-639; S-260 
to Q-639;G-261 to Q-639; R-262 to Q-639; Y-263 to Q-639; T-264 toQ-639; C-265 to 
Q-639; R-266 to Q-639; A-267 to Q-639; E-268to Q-639; N-269 to Q-639; R-270 to Q- 
639; L-271 to Q-639;G-272 to Q-639; S-273 to Q-639; Q-274 to Q-639; Q-275 toQ-639; 
R-276 to Q-639; A-277 to Q-639; L-278 to Q-639; D-279to Q-639; L-280 to Q-639; S- 
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281 to Q-639; V-282 to Q-639; Q-283to Q-639; Y-284 to Q-639; P-285 to Q-639; P-286 
to Q-639; E-287to Q-639; N 288 to Q-639; L-289 to Q-639; R-290 to Q-639; V-291 to 
Q-639; M-292 to Q-639; V-293 to Q-639; S-294 toQ-639; Q-295 to Q-639; A-296 to Q- 
639; N-297 to Q-639; R-298to Q-639; T-299 to Q-639; V-300 to Q-639; L-301 to Q-639; 
5 E-302to Q-639; N-303 to Q-639; L-304 to Q-639; G-305 to Q-639;N-306 to Q-639; G- 
307 to Q-639; T-308 to Q-639; S-309 toQ-639; L-310 to Q-639; P-31 1 to Q-639; V-312 
to Q-639; L-313 toQ-639; E-3I4 to Q-639; G-315 to Q-639; Q-316 to Q-639; S-317to Q- 
639; L-318 to Q-639; C-319 to Q-639; L-320 to Q-639; V-321to Q-639; C-322 to Q-639; 
V-323 to Q-639; T-324 to Q-639;H-325 to Q-639; S-326 to Q-639; S-327 to Q-639; P- 

10 328 to Q-639;P-329 to Q-639; A-330 to Q 639; R-331 to Q-639; L-332 to Q-639;S-333 
to Q-639; W-334 to Q-639; T-335 to Q-639; Q-336 toQ-639; R-337 to Q-639; G-338 to 
Q-639; Q-339 to Q-639; V-340to Q-639; L-341 to Q-639; S-342 to Q-639; P-343 to Q- 
639; S-344to Q-639; Q-345 to Q-639; P-346 to Q-639; S-347 to Q-639; D-348to Q-639; 
P-349 to Q-639; G-350 to Q-639; V-351 to Q-639;L-352 to Q-639; E-353 to Q-639; L- 

15 354 to Q-639; P-355 to Q-639;R-356 to Q-639; V-357 to Q-639; Q-358 to Q-639; V-359 
toQ-639; E-360 to Q-639; H-361 to Q-639; E-362 to Q-639; G-363to Q-639; E-364 to Q- 
639; F-365 to Q-639; T-366 to Q-639; C-367to Q-639; H-368 to Q-639; A-369 to Q-639; 
R-370 to Q-639;H-371 to Q-639; P-372 to Q-639; L-373 to Q-639; G-374 toQ-639; S- 
375 to Q-639; Q-376 to Q-639; H-377 to Q-639; V-378to Q-639; S-379 to Q-639; L-380 

20 to Q-639; S-381 to Q-639; L-382to Q-639; S-383 to Q-639; V-384 to Q-639; H-385 to 
Q-639;Y-386 to Q-639; S-387 to Q-639; P-388 to Q-639; K-389 toQ-639; L-390 to Q- 
639; L-391 to Q-639; G-392 to Q-639; P-393 toQ-639; S-394 to Q-639; C-395 to Q-639; 
S-396 to Q-639; W-397to Q-639; E-398 to Q-639; A-399 to Q-639; E-400 to Q-639;G- 
401 to Q-639; L-402 to Q-639; H-403 to Q-639; C-404 toQ-639; S-405 to Q-639; C-406 

25 to Q-639; S-407 to Q-639; S-408 toQ-639; Q-409 to Q-639; A-410 to Q-639; S-41 1 to 
Q-639; P-412 toQ-639; A-413 to Q-639; P-414 to Q-639; S-415 to Q-639; L-416 toQ- 
639; R-417 to Q-639; W-418 to Q-639; W-419 to Q-639; L-420to Q-639; G-421 to Q- 
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639; E-422 to Q-639; E-423 to Q-639; L-424to Q-639; L-425 to Q-639; E-426 to Q-639; 
G-427 to Q-639;N-428 to Q-639; S-429 to Q-639; S-430 to Q-639; Q-431 toQ-639; D- 
432 to Q-639; S-433 to Q-639; F-434 to Q-639; E-435 toQ-639; V-436 to Q-639; T-437 
to Q-639; P-438 to Q-639; S-439 toQ-639; S-440 to Q-639; A-441 to Q-639; G-442 to 
5 Q-639; P-443to Q-639; W-444 to Q-639; A-445 to Q-639; N-446 to Q-639;S-447 to Q- 
639; S-448 to Q-639; L-449 to Q-639; S-450 to Q-639;L-451 to Q-639; H-452 to Q-639; 
G-453 to Q-639; G-454 toQ-639; L-455 to Q-639; S-456 to Q-639; S-457 to Q-639; G- 
458 toQ-639; L-459 to Q-639; R-460 to Q-639; L-461 to Q-639; R-462 toQ-639; C-463 
to Q-639; E-464 to Q-639; A-465 to Q-639; W-466to Q-639; N-467 to Q-639; V-468 to 

10 Q-639; H-469 to Q-639;G-470 to Q-639; A-471 to Q-639; Q-472 to Q-639; S-473 toQ- 
639; G-474 to Q-639; S-475 to Q-639; 1-476 to Q-639; L-477 toQ-639; Q-478 to Q-639; 
L-479 to Q-639; P-480 to Q-639; D-481 toQ-639; K-482 to Q-639; K-483 to Q-639; G- 
484 to Q-639; L-485to Q-639; 1-486 to Q-639; S-487 to Q-639; T-488 to Q-639; A-489to 
Q-639; F-490 to Q-639; S-491 to Q-639; N-492 to Q-639;G-493 to Q-639; A-494 to Q- 

15 639; F-495 to Q-639; L-496 toQ-639; G-497 to Q-639; 1-498 to Q-639; G-499 to Q-639; 
1-500 toQ-639; T-501 to Q-639; A-502 to Q-639; L-503 to Q-639; L-504 toQ-639; F-505 
to Q-639; L-506 to Q-639; C-507 to Q-639; L-508 toQ-639; A-509 to Q-639; L-510 to 
Q-639; 1-51 1 to Q-639; 1-512 toQ-639; M-513 to Q-639; K-514 to Q-639; 1-515 to Q- 
639; L-516 toQ-639; P-517 to Q-639; K-518 to Q-639; R-519 to Q-639; R-520to Q-639; 

20 T-521 to Q-639; Q-522 to Q-639; T-523 to Q-639; E-524to Q-639; T-525 to Q-639; P- 
526 to Q-639; R-527 to Q-639; P-528to Q-639; R-529 to Q-639; F-530 to Q-639; S-531 
to Q-639; R-532to Q-639; H-533 to Q-639; S-534 to Q-639; T-535 to Q-639; I-536to Q- 
639; L-537 to Q-639; D-538 to Q-639; Y-539 to Q-639; I-540to Q-639; N-541 to Q-639; 
V-542 to Q-639; V-543 to Q-639;P-544 to Q-639; T-545 to Q-639; A-546 to Q-639; G- 

25 547 toQ-639; P-548 to Q-639; L-549 to Q-639; A-550 to Q-639; Q-551 toQ-639; K-552 
to Q-639; R-553 to Q-639; N-554 to Q-639; Q-555to Q-639; K-556 to Q-639; A-557 to 
Q-639; T-558 to Q-639;P-559 to Q-639; N-560 to Q-639; S-561 to Q-639; P-562 to Q- 
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639;R-563 to Q-639; T-564 to Q-639; P-565 to Q-639; L-566 to Q-639;P-567 to Q-639; 
P-568 to Q-639; G-569 to Q-639; A-570 toQ-639; P-571 to Q-639; S-572 to Q-639; P- 
573 to Q-639; E-574 toQ-639; S-575 to Q-639; K-576 to Q-639; K-577 to Q-639; N- 
578to Q-639; Q-579 to Q-639; K-580 to Q-639; K-581 to Q-639;Q-582 to Q-639; Y-583 
to Q-639; Q-584 to Q-639; L-585 toQ-639; P-586 to Q-639; S-587 to Q-639; F-588 to Q- 
639; P-589 toQ-639; E-590 to Q-639; P-591 to Q-639; K-592 to Q-639; S-593 toQ-639; 
S-594 to Q-639; T-595 to Q-639; Q-596 to Q-639; A-597 toQ-639; P-598 to Q-639; E- 
599 to Q-639; S-600 to Q-639; Q-601 toQ-639; E-602 to Q-639; S-603 to Q-639; Q-604 
to Q-639; E-605 toQ-639; E-606 to Q-639; L-607 to Q-639; H-608 to Q-639; Y-609to Q- 
639: A-610 to Q-639; T-61 1 to Q-639; L-612 to Q-639;N-6i3 to Q-639; F-614 to Q-639; 
P-615 to Q-639; G-616 toQ-639; V-617 to Q-639; R-618 to Q-639; P-619 to Q-639; R- 
620to Q-639; P-621 to Q-639; E-622 to Q-639; A-623 to Q-639; R-624to Q-639; M-625 
to Q-639; P-626 to Q-639; K-627 to Q-639;G-628 to Q-639; T-629 to Q-639; Q-630 to 
Q-639; A-631 toQ-639; D-632 to Q-639; Y-633 to Q-639; A-634 to Q-639; of SEQID 
NO:33. Polypeptides encoded by these polynucleotides are also encompassed by the 
invention. 

Also as mentioned above, even if deletion of one or more amino acids from the 
C-terminus of a protein results in modification or loss of one or more biological 
functions of the protein, other functional activities (e.g., biological activities (e.g., ability 
to illicit mitogenic activity, induce differentiation of normal or malignant cells, bind to 
EGF receptors, etc.)), may still be retained. For example the ability to induce and/or bind 
to antibodies which recognize the complete or mature forms of the polypeptide generally 
will be retained when less than the majority of the residues of the complete or mature 
polypeptide are removed from the C-terminus. Whether a particular polypeptide lacking 
C-terminal residues of a complete polypeptide retains such immunologic activities can 
readily be determined by routine methods described herein and otherwise known in the 
art. It is not unlikely that a mutein with a large number of deleted C-terminal amino acid 
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residues may retain some biological or immunogenic activities. In fact, peptides 
composed of as few as six amino acid residues may often evoke an immune response. 

Accordingly, the present invention further provides polypeptides having one or 
more residues deleted from the carboxy terminus of the amino acid sequence of the 
polypeptide shown in Figures 13A-C, up to the leucine residue at position number 7, and 
polynucleotides encoding such polypeptides. In particular, the present invention 
provides polypeptides comprising the amino acid sequence of residues 1-ml of Figures 
13A-C, where ml is an integer from 7 to 638 corresponding to the position of the amino 
acid residue in Figures I3A-C. Moreover, the invention provides polynucleotides 
encoding polypeptides comprising, or alternatively consisting of, the amino acid 
sequence of C-terminal deletions of the polypeptide of the invention shown as SEQ ID 
NO:33 include polypeptides comprising the amino acid sequence of residues: M-l to F- 
638; M-l to K-637;M-1 to V-636; M-l to E-635; M-l to A-634; M-l to Y-633; M-l 
toD-632; M-I to A-631; M-l to Q-630; M-l to T-629; M-l to G-628;M-1 to K-627; M-l 
to P-626; M-l to M-625; M-l to R-624; M-l toA-623; M-l to E-622; M-l to P-621; M-l 
to R-620; M-l to P-619;M-1 to R-618; M-l to V-617; M-l to G-616; M-l to P-615: M-l 
toF-614; M-l to N-613;M-I to L-6I2; M-l toT-611;M-l to A-610;M-1 to Y-609; M-l 
to H-608; M-l to L-607; M-l to E-606; M-l toE-605; M-l to Q-604; M-l to S-603; M-l 
to E-602; M-l to Q-601;M-1 to S-600; M-l to E-599; M-l to P-598; M-l to A-597; M-l 
toQ-596; M-l to T-595; M-l to S-594; M-l to S-593; M-l to K-592;M-1 to P-591; M-l 
to E-590; M-l to P-589; M-l to F-588; M-l toS-587; M-l to P-586; M-l to L-585; M-l 
to Q-584; M-l to Y-583;M-1 to Q-582; M-l to K-581; M-l to K-580; M-l to Q-579; M- 
1 toN-578; M-l to K-577; M-l to K-576; M-l to S-575; M-l to E-574;M-1 to P-573; M- 
1 to S-572; M-l to P-571; M-l to A-570; M-l toG-569; M-l to P-568; M-l to P-567; M- 
1 to L-566; M-l to P-565;M-1 to T-564; M-l to R-563; M-l to P-562; M-l to S-561; M- 
1 toN-560; M-l to P-559; M-l to T-558; M-l to A-557; M-l to K-556;M-1 to Q-555; M- 
1 to N-554; M-l to R-553; M-l to K-552; M l toQ-551; M-l to A-550; M-I to L-549; 
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M-l to P-548; M-l to G-547;M-1 to A-546; M-l to T-545; M-l to P-544; M-l to V-543; 
M-l toV-542; M-l to N-541; M-l to 1-540; M-l to Y-539; M-l to D-538;M-1 to L-537; 
M-l to 1-536; M-l to T-535; M-l to S-534; M-l toH-533; M-l to R-532; M-l to S-531; 
M-l to F-530; M-l to R-529;M-1 to P-528; M-l to R-527; M-l to P-526; M-l to T-525; 
M-l toE-524; M-l to T-523; M-l to Q-522; M-l to T-521; M-l to R-520;M-1 to R-519; 
M-l to K-518;M-1 to P-517;M-1 toL-516; M-l toI-515; M-i to K-514; M-l to M-513; 
M-l to 1-512; M-l to I-51i;M-l to L-510; M-l to A-509; M-l to L-508; M-i to C-507; 
M-l toL-506; M-l to F-505; M-l to L-504; M-l to L-503; M-l to A-502;M-1 to T-501; 
M-l to 1-500; M-l to G-499; M-l to 1-498; M-l toG-497; M-l to L-496; M-l to F-495; 
M-l to A-494; M-l to G-493;M-1 to N-492; M-l to S-491; M-l to F-490; M-l to A-489; 
M-l toT-488; M-l to S-487; M-l to 1-486; M-l to L-485; M-l to G-484;M-1 to K-483; 
M-l to K-482; M-l to D-481; M-l to P-480; M-l toL-479; M-l to Q-478; M-l to L-477; 
M-l to 1-476; M-l to S-475;M-1 to G-474; M-l to S-473; M-l to Q-472; M-l to A-471; 
M-l toG-470; M-l to H-469; M-l to V-468; M-l to N-467; M-l toW-466; M-l to A- 
465; M-l to E-464; M-l to C-463; M-l to R-462;M-1 to L-461; M-l to R-460; M-l to L- 
459; M-l to G-458; M-l toS-457; M-l to S-456; M-l to L-455; M-l to G-454; M-l to G- 
453;M-1 to H-452; M-l to L-451; M-l to S-450; M-l to L-449; M-l toS-448; M-l to S- 
447; M-l to N-446; M-l to A-445; M-l to W-444;M-1 to P-443; M-l to G-442; M-l to 
A-441; M-l to S-440; M-l toS-439; M-i to P-438; M-l to T-437; M-l to V-436; M-l to 
E-435;M-1 to F-434; M-l to S-433; M-l to D-432; M-l to Q-431; M-l toS-430; M-l to 
S-429; M-l to N-428; M-l to G-427; M-l to E-426;M-i to L-425; M-l to L-424; M-l to 
E-423; M-l to E-422; M l toG-421; M-l to L-420; M-l to W-419; M-l to W-418; M-l 
toR-417; M-l to L-416; M-l to S-415; M-l to P-414; M-l to A-413;M-1 to P-412; M-l 
to S-41 1 ; M l to A-410; M-l to Q-409; M-l toS-408; M-l to S-407; M-l to C-406; M-l 
to S-405; M l to C-404;M-1 to H-403; M-l to L-402; M-l to G-401; M-l to E-400; M-l 
toA-399; M-l to E-398; M-l to W-397; M-l to S-396; M-l to C-395;M-1 to S-394; M-l 
to P-393; M-l to G-392; M-l to L-391; M-l toL-390; M-l to K-389; M-l to P-388; M-l 
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to S-387; M-i to Y-386;M-1 to H-385; M-l to V-384; M-l to S-383; M-l to L-382; M-l 
toS-381; M-l to L-380; M-l to S-379; M-l to V-378; M-l to H-377;M-1 to Q-376; M-l 
to S-375; M-l to G-374; M-l to L-373; M-l toP-372; M-l to H-371; M-l to R-370; M-l 
to A-369; M-l to H-368;M-1 to C-367; M-l to T-366; M-l to F-365; M-l to E-364; M-l 
toG-363; M-l to E-362; M-l to H-361; M-l to E-360; M-l to V-359;M-1 to Q-358; M-l 
to V-357; M-l to R-356; M-l to P-355; M-l toL-354; M-l to E-353; M-l to L-352; M-l 
to V-351; M-l to G-350;M-1 to P-349; M-l to D-348; M-l to S-347; M-l to P-346; M-l 
toQ-345; M-l to S-344; M-l to P-343; M-l to S-342; M-l to L-341;M-1 to V-340; M-l 
to Q-339; M-l to G-338; M-l to R-337; M-l toQ-336; M-l to T-335; M-l to W-334; M- 
1 to S-333; M-l to L-332;M-1 to R-331; M-l to A-330; M-l to P-329; M-l to P-328; M- 
1 toS-327; M-l to S-326; M-l to H-325; M-l to T-324; M-l to V-323;M-1 to C-322; M- 
1 to V-321; M-l to L-320; M-l to C-319; M-l toL-318; M-l to S-3I7; M-l to Q-316; M- 
1 toG-315;M-l toE-314;M-l toL-313; M-l to V-312; M-l toP-311;M-l to L-310; M- 
1 toS-309; M-l to T-308; M-l to G-307; M-l to N-306; M-l to G-305;M-1 to L-304; M- 
1 to N-303; M-l to E-302; M-l to L-301; M-l toV-300; M-l to T-299; M-l to R-298; 
M-l to N-297; M-l to A-296;M-1 to Q-295; M-l to S-294; M-l to V-293; M-l to M- 
292; M-l toV-291; M-l to R-290; M-l to L-289; M-l to N-288; M-l to E-287;M-1 to P- 
286; M-l to P-285; M-i to Y-284; M-l to Q-283; M-l toV-282; M-i to S-281; M-l to L- 
280; M-l to D-279; M-l to L-278;M-1 to A-277; M-l to R-276; M-l to Q-275; M-l to 
Q-274; M-l toS-273; M-l to G-272; M-l to L-271; M-l to R-270; M-l to N-269;M-1 to 
E-268; M-l to A-267; M-l to R-266; M-l to C-265; M-l toT-264; M-l to Y-263; M-l to 
R-262; M-l to G-261; M-l to S-260;M-1 to D-259; M-l to G-258; M-l to A-257; M-l to 
K-256; M-l toV-255; M l to G-254; M-l to P-253; M-l to L-252; M-l to E-251;M-1 to 
L-250; M-l to G-249; M-l to L-248; M-l to P-247; M-l toR-246; M-l to P-245; M-l to 
G-244; M-l to W-243; M-l to P-242;M-1 to H-241; M-l to S-240; M-l to S-239; M-l to 
S-238; M-l toL-237; M-l to V-236; M-l to R-235; M-l to N-234; M-l to Q-233;M-1 to 
L-232; M-l to V-231; M-l to W-230; M-l to S-229; M-l toL-228; M-l to T-227; M-l to 
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A-226; M-l to P-225; M-l to P-224;M-1 to Q-223; M-l to S-222; M-l to D-221; M-l to 
A-220; M-l toA-219; M-l to C-218; M-l to L-217; M-l to L-216; M-l to R-215;M-1 to 
L-214; M-l to F-213; M l to Q-212; M-l to G-211; M-l toK-210; M-l to Q-209; M-l to 
A-208; M-l to E-207; M-l to L-206;M-1 to Y-205; M-l to P-204; M-l to V-203; M-i to 
N-202;M-1 toG-201;M-l to Q-200; M-l to P-199; M-l to Q-198;M-1 toP-197;M-l to 
E-196; M-l to L-195; M-l to A-194; M-l to P-193; M-l toT-192; M-l toN-191;M-l to 
D-190; M-l to R-189; M-l to S-188;M-1 to 1-187; M-l to S-186; M-l to 1-185; M-l to 
V-184; M-l toL-183; M-I to D-182; M-l to R-181; M-l to P-180; M-l to A-179;M-1 to 
Y-178; M-l to A-177; M-l to V-176; M-l to R-175; M-l toL-174; M-l to R-173; M-l to 
V-172; M-l to T-171; M-l to R-170;M-1 to Q-169; M-l to A-168; M-l to S-167; M-l to 
V-166; M-l toG-165; M-l to K-164; M-l to R-163; M-l to S-162; M-l to F-161;M-1 to 
D-160; M-l to V-159; M-l to H-158; M-l to C-157; M-l toT-156; M-l to L-155; M-l to 
D-154; M-l to T-153; M-l to N-152;M-1 to H-151; M-l to D-150; M-l to Q-149; M-l 
to P-148; M-l toR-147;M-l to P-146; M-l toT-145; M-l to F-144; M-l to S-143;M-1 
to L-142; M-l to V-141; M-l to T-140; M-l to V-139; M-l toK-138; M-l to L-137; M-l 
toF-136; M-l toF-135; M-l to G-134;M-1 toD-133; M-l to N-132; M-l toM-131;M-l 
to F-130; M-l toN-129; M-l to Y-128; M-l to R-127; M-l to V-126; M-l to Y-125;M-1 
to S-124; M-l to G-123; M-l to R-122; M-l to E-121; M-l toV-120; M-l to R-l 19; M-l 
to F-118;M-1 toF-117;M-l to Y-116;M-1 to Q-115;M-1 to S-114; M-l toE-113;M-l 
toD-112;M-l toQ-lll;M-l to M-110; M-l to Q-109; M-l to A-108; M-l to D-107;M- 
1 to R-106; M-l to 1-105; M-l to V-104; M-l to L-103; M-l toS-102; M-l to C-101; M- 
1 to N-100; M-l to G-99; M-l to K-98;M-1 to A-97; M-l to P-96; M-l to D-95; M-l to 
G-94; M-l toT-93; M-l to L-92; M-l to Q-91; M-l to F-90; M-l to R-89; M-l toG-88; 
M-l to R-87; M-l to T-86; M-l to S-85; M-l to M-84; M-l to E-83; M-l to V-82; M-l to 
E-81; M-l to R-80; M-l to S-79; M-lto Q-78; M-l to H-77; M-l to N-76; M-l to T-75; 
M-l to A-74;M-1 to V-73; M-l to P-72; M-l to A-71; M-l to G-70; M-l toK-69; M-l to 
T-68; M-l to T-67; M-l to E-66; M-l to T-65; M-l toV-64; M-l to A-63; M-l to K-62; 
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M-l to F-61; M~l to W-60; M-lto Y-59; Ml to G-58; M-l to Y-57; M-l to A-56; M l 
to P-55;M-1 to T-54; M-l to S-53; M-l to G-52; M-l to T-51; M-l toW-50; M-l to D- 
49; M-l to Q-48; M-l to R-47; M-l to P-46; M-lto Y-45; M-l to S-44; M-l to F-43; M- 
1 to S-42; M-l to C-41; M-lto P-40; M-l to V-39; M-l to S-38; M-l to 1-37; M-l to D- 
36; M-lto C-35; M-l to A-34; M-l to E-33; M-l to P-32; M-l to V-31;M-1 to M-30; M- 
1 to V-29; M-l to S-28; M-l to E-27; M-l toQ-26; M-l to V-25; M-l to R-24; M-l to I- 
23; M-l to W-22; M-lto F-21; M-l to R-20; M-l to G-19; M-l to D-18; M-l to M- 
17;M-1 to A-16; M-l to Q-15; M-l to S-14; M-l to G-13;M-1 toG-12; M-l to L-1I;M- 
1 to L-10; M-l to S-9; M-l to S-8; M-l toL-7; of SEQ ID NO:33. Polypeptides encoded 
by these polynucleotides are also encompassed by the invention. 

In addition, the invention provides nucleic acid molecules having nucleotide 
sequences related to extensive portions of SEQ ID NO: 15 which have been determined 
from the following related cDNA genes: HTOFA26R (SEQ ID NO:93), HWAEM43R 
(SEQ ID NO:94), HDPMQ69R (SEQ ID NO:95), HDPGA09RA (SEQ ID NO:96), 
HEOMH10R (SEQ ID NO:97), and HFKCT73F (SEQ ID NO:98). 

The polypeptide of this gene has been determined to have a transmembrane 
domain at about amino acid position 496 - 512 of the amino acid sequence referenced in 
Table XIII for this gene. Moreover, a cytoplasmic tail encompassing amino acids 513 to 
639 of this protein has also been determined. Based upon these characteristics, it is 
believed that the protein product of this gene shares structural features to type la 
membrane proteins. 

Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identification of the tissue(s) or cell type(s) present in a 
biological sample and for diagnosis of the following diseases and conditions which 
include, but are not limited to, disorders of the immune system, in particular the 
immunodiagnosis of acute leukemias. Similarly, polypeptides and antibodies directed to 
these polypeptides are useful to provide immunological probes for differentia] 
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identification of the tissue(s) or cell type(s). For a number of disorders of the above 
tissues or ceils, particularly of the immune system, expression of this gene at 
significantly higher or lower levels is detected in certain tissues or cell types (e.g., 
immune, cancerous and wounded tissues) or bodily fluids (e.g., lymph, serum, plasma, 
urine, synovial fluid or spinal fluid) or another tissue or cell sample taken from an 
individual having such a disorder, relative to the standard gene expression level, i.e., the 
expression level in healthy tissue from an individual not having the disorder. 

Preferred epitopes include those comprising a sequence shown in SEQ ID NO: 33 
as residues: Pro-46 to Gly-52, Asn-76 to Val 82, Ser-85 to Phe-90, Gly-94 to Asn-100, 
GIn-1 U to Tyr-116, Pro-146 to Leu-155, Ser-188 to Asn-202, Ser-240 to Arg-246, Gly- 
258 to Tyr-263, AIa-267 to Arg-276, Ser-326 to Arg-331, Ser-333 to GIn-339, Pro-343 
to Asp-348, Glu-426 to Asp-432, Pro-517 to His-533, Ala-550 to Pro-565, Gly-569 to 
Gln-582, Pro-589 to Glu-606, Gly-616 to Ala-623, Met-625 to Ala-631. 

CD33 monoclonal antibodies (MoAB) are important in the immunodiagnosis of 
AML. CD33 MoABs have been used in preliminary therapeutic trials for purging bone 
marrow of AML patients, either before transplantation or for diseases resistant to 
chemotherapy. To prevent AML patients in remission from suffering relapse, or due to 
the lack of an appropriate allogenic bone marrow donor, a method is necessary for 
purging leukemia cells from the autografts of patients with advanced AML. By the 
invention, this method is provided by which bone marrow from an AML patient is 
obtained by, for example, percutaneous aspirations from the posterior iliac crest, 
isolating bone marrow mononuclear by Ficoll-hypaque density gradient centrifugation, 
and incubating with an anti-CD33-like protein MoAB, for example, 3-5 times for 15-30 
min. at 4-6 degrees C, followed by incubation with rabbit complement at about 37 
degrees C for 30 minutes. The patient is then subject to myeloablative chemotherapy, 
followed by reinfusion of the treated autologous bone marrow according to standard 
techniques. By the invention, clonogenic tumor cells are depleted from the bone marrow 
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while sparing hematopoietic cells necessary for engraftment. In a further 
embodimenMhe invention provides an in vivo method for selectively killing or inhibiting 
growth of tumor cells expressing CD33-like protein antigen of the present invention. The 
method involves administering to the patient an effective amount of an antagonist to 
5 inhibit the CD33-like protein receptor signaling pathway. By the invention, 

administering such antagonist of the CD33-like protein to a patient may also be useful 
for treating inflammatory diseases including arthritis and colitis. Antagonists for use in 
the present invention include polyclonal and monoclonal antibodies raised aginst the 
CD33-like protein or a fragment thereof, antisense molecules which control gene 
10 expression through antisense DNA or RNA or through triple-helix formation, proteins or 
other compounds which bind the CD33-like protein domains, or soluble forms of the 
CD33-like protein, such as protein fragments including the extracellular region from the 
full length receptor, which antagonize CD33-like protein mediated signaling by 
competing with the cell surface CD33-like protein for binding to CD33 receptor ligands. 

15 

Many polynucleotide sequences, such as EST sequences, are publicly available 
and accessible through sequence databases. Some of these sequences are related to SEQ 
ID NO: 15 and may have been publicly available prior to conception of the present 
invention. Preferably, such related polynucleotides are specifically excluded from the 

20 scope of the present invention. To list every related sequence is cumbersome. 
Accordingly, preferably excluded from the present invention are one or more 
polynucleotides comprising a nucleotide sequence described by the general formula of 
a-b, where a is any integer between 1 to 2281 of SEQ ID NO:15, b is an integer of 15 to 
2295, where both a and b correspond to the positions of nucleotide residues shown in 

25 SEQ ID NO; 15, and where b is greater than or equal to a + 14. 
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FEATURES OF PROTEIN ENCODED BY GENE NO: 6 


This invention relates to newly identified polynucleotides, polypeptides encoded 
5 by such polynucleotides, the use of such polynucleotides and polypeptides, as well as the 
production of such polynucleotides and polypeptides. The polypeptide of the present 
invention has been putatively identified as a CD33 homolog derived from a human 
primary dendritic cells cDNA library. More particularly, the polypeptide of the present 
invention has been putatively identified as a human siglec homolog, sometimes hereafter 
10 referred to as "CD33-like 3" and/or "siglec 7". The invention also relates to inhibiting 
the action of such polypeptides. 

The siglecs (sialic acid binding Ig-like lectins) are type 1 membrane proteins that 
constitute a distinct subset of the Ig superfamily, characterised by their sequence 
similarities and abilities to bind sialic acids in glycoproteins and glycolipid (Crocker, 
15 et al., GIycobiology:8 (1998)). Members of the Ig Superfamily of proteins are 

defined as molecules that share domains of sequence similarity with the variable or 
constant domains of antibodies. 

Many Ig superfamily proteins consist of multiple tandem Ig-like domains 
connected to other domains, such as Fn-III repeat domains (Vaughn, D.E., and P.J. 
20 Bjorkman, Neuron, 16:261-73 (1996)). At the primary structural level, traditional Ig-like 
domains can be identified by the presence of two cysteine residues separated by 
approximately 55-75 amino acid residues, and an "invariant" tryptophan residue located 
10-15 residues C-terminal to the first of the two conserved cysteine residues. The two 
conserved cysteine residues are thought to be involved in disulfide bonding to form the 
25 folded Ig structures (Vaughn, D.E., (1996)). 

Ig-like domains further share a common folding pattern, that of a sandwich or 
fold structure of two b-sheets consisting of antiparallel b-strands containing 5-10 amino 
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acids (Huang, Z., et al., Biopolymers, 43:367-82 (1997)). Ig-Iike domains are divided, 
based upon sequence and structural similarities, into four classifications known as CI, 
C2, 1 and V-like domains. 

The functional determinants of the Ig-like domains are presented on the faces of 
b-sheets or the loop regions of the Ig-fold. Accordingly, protein-protein interactions can 
occur either between the faces of the b-sheets, or the loop regions of the Ig-fold (Huang, 
Z., (1998)). These Ig-like domains are involved in mediating a diversity of biological 
functions such as intermolecular binding and protein-protein homophilic or heterophilic 
interactions. 

Thus, Ig-like domains play an integral role in facilitating the activities of proteins 
of the Ig superfamily. In mammals, the group currently comprises sialoadhesin/siglec-1, 
CD22/siglec-2, CD33/siglec-3, myelin associated glycoprotein (MAG/siglec-4), siglecs- 
5, -6 and -7 

(Crocker, P.R., et al, EMBO J., 13:4490-503 (1994); Sgroi, D., et al., J Biol. Chem., 
268:701 1-18 (1993); Freeman, D.S., et al., Blood, 85:2005-12 (1995); Kelm, S., et al., 
Curr Biol, 4:965-72 (1994); Cornish, A.L., et al., Blood, 92:2123-132 (1998); Pate], N., 
et al., J Biol. Chem, 274:22729-738 (1999); Nicol, G., et al., J Biol. Chem, In Press 
(1999)). Siglec-7 has also recently been characterised independently as the NK receptor 
p75/AIRMl (Falco, JVL, et al., J. Exp. Med., 190:793-802 (1999)). In addition, the gene 
encoding another siglec-like sequence, OBBP-like protein has been reported but there is 
no information on its binding activity (Yousef, G.M., et al., Biochem. Biophys. Res. 
Commun., In Press (1999)). 

Each of these proteins has an extracellular region made up of a membrane distal 
V-set domain followed by varying numbers of C2 set domains which range from 16 in 
sialoadhesin to 1 in CD33. In the cases of sialoadhesin, CD22, MAG and CD33, the 
sialic acid binding site has been mapped to the V-set domain and for sialoadhesin it has 
been further characterised at the molecular level by X-ray crystallography 1 1 (Nath, D., 
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et al., J Biol. Chem., 270:26184-91 (1995); van der Merwe, P.A., et al., J. Biol. Chem., 
271:9273-80 (1996); Tang, S., et al., J. Cell Biol., 138:1355-66 (1997); Taylor, V.C., et 
al., J. Biol. Chem., 274:11505-12 (1999); May, A.P., et aL, Molecular Cell, 1:719-28 
(1998)). 

Apart from MAG and SMP that are found exclusively in the nervous system, all 
siglecs characterised to date are expressed on discrete subsets of hemopoietic cells and 
can provide useful lineage-restricted markers. Thus, CD22 is present only on mature B 
cells, sialoadhesin is on macrophage subsets, CD33 is a marker of early committed 
myeloid progenitor cells, siglec-5 is expressed by monocytes and mature neutrophils, 
siglec-6 is on B cells and siglec-7 is expressed by NK cells and monocytes (Dorken, B., 
et al., J. Immunology, 136:4470-79 (1986); Crocker, P.R., et al., J. Exp. Med., 164:1862- 
75 (1986); Peiper, S.C., et al., In Leukocyte Typing IV. Oxford University Press, Oxford. 
814-16 (1989); Cornish, A.L., et ah, Blood, 92:2123-132 (1998); Patel, N., et al., J Biol. 
Chem, 274:22729-738 (1999); Nicol, G., et al., J Biol. Chem, In Press (1999)). These 
expression patterns indicate discrete functions amongst hemopoietic cell subsets, but 
apart from CD22, a well -characterised negative regulator of B cell activation (reviewed 
in Cyster, J.G. and C.C. Goodnow, Immunity, 6:509-17 (1997)), the biological functions 
of siglecs expressed in the hemopoietic system are unknown. Proposed functions include 
cell-cell interactions through recognition of sialylated glycoconjugates on other cells. 
However, a number of studies have also shown that cell-cell adhesion mediated by 
siglecs can be modulated by cis-interactions with sialic acids present in the host plasma 
membrane. This is particularly striking for CD22, CD33 and siglec-5, whose binding 
activities can be greatly increased if host cells are pretreated with sialidase to remove the 
cis-competing sialic acids (Freeman, D.S., et al., Blood, 85:2005-12 (1995); Cornish, 
A.L., et al., Blood, 92:2123-132 (1998); Sgroi, D., et al., P.N.A.S., 92:4026-30 (1995)). 

Besides potential roles in cellular interactions, there is growing evidence that, 
similar to CD22, the more recently characterised siglecs are involved in signalling 
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functions. The cytoplasmic tails of CD33 and siglecs-5, -6 and -7 have two well- 
conserved tyrosine-based motifs that are similar to well -characterised signaling motifs in 
other leukocyte receptors (Gergely, J., et al., Immun. Lett., 68:3-15 (1999)). Where 
studied, both tyrosine residues can be phosphorylated by src-like kinase(s) and, in the 
case of the membrane proximal tyrosine, this leads to subsequent recruitment of the 
tyrosine phosphatases, SHP-i and SHP-2 (Falco, M, et al., J. Exp. Med., 190:793-802 
(1999); Taylor, V.C., et al., J. Biol. Chem., 274:11505-12 (1999)). 

Thus there exists a clear need for identifying and exploiting novel members of the siglec 
family of immunoglobulin proteins. Although structurally related, such proteins may 
possess diverse and multifaceted functions in a variety of cell and tissue types. The 
inventive purified siglec proteins are research tools useful for the identification, 
characterization and purification of cell signaling molecules. Furthermore, the 
identification of new siglecs permits the development of a range of derivatives, agonists 
and antagonists at the nucleic acid and protein levels which in turn have applications in 
the treatment and diagnosis of a range of conditions such as cancer, inflammation, 
neurological disorders and immunological disorders, amongst many other conditions. 
The polypeptide of the present invention has been putatively identified as a member of 
the siglec family and has been termed CD33-like 3. This identification has been made as 
a result of amino acid sequence homology to the human cd3311 (See Genbank Accession 
No. gi|29 13995). 

Figures 16A-B show the nucleotide (SEQ ID NO:16) and deduced amino acid 
sequence (SEQ ID NO:34) of CD33-like 3. Predicted amino acids from about 1 to about 
18 constitute the predicted signal peptide (amino acid residues from about 1 to about 18 
in SEQ ID NO:34) and are represented by the underlined amino acid regions; and amino 
acids from about 360 to about 376 constitute the predicted transmembrane domain 
(amino acids from about 360 to about 376 in SEQ ID NO:34) and are represented by the 
double underlined amino acids. 
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Figure 17 shows the regions of similarity between the amino acid sequences of 
the CD33-like 3 protein (SEQ ID NO:34) and the human CD33L1 protein (SEQ ID 
NO:99). 

Figure 18 shows an analysis of the CD33-like 3 amino acid sequence. Alpha, 
beta, turn and coil regions; hydrophiiicity and hydroph obi city; amphipathic regions; 
flexible regions; antigenic index and surface probability are shown. 

A polynucleotide encoding a polypeptide of the present invention is obtained 
from human NK cells, T-cells, primary dendritic cells, placenta, spleen, primary breast 
cancer, gall bladder, apoptotic t-cells, macrophage, and chronic lymphocytic leukemia 
spleen. The polynucleotide of this invention was discovered in a human primary 
dendritic cell cDNA library. 

As shown in Figures 16A-B, CD33-like 3 has a transmembrane domain (the 
transmembrane domains comprise amino acids from about 360 to about 376 of SEQ ID 
NO:34; which correspond to amino acids from about 360 to about 376 of Figures 16A-B 
). The polynucleotide contains an open reading frame encoding the CD33-Iike 3 
polypeptide of 467 amino acids. CD33-like 3 exhibits a high degree of homology at the 
amino acid level to the human CD33L1 (as shown in Figure 18). 

The present invention provides isolated nucleic acid molecules comprising a 
polynucleotide encoding the CD33-like 3 polypeptide having the amino acid sequence 
shown in Figures 16A-B (SEQ ID NO:34). The nucleotide sequence shown in Figures 
16A-B (SEQ ID NO: 16) was obtained by sequencing a cloned cDNA (HDPUW68), 
which was deposited on November 17 at the American Type Culture Collection, and 
given Accession Number 203484. 

The present invention is further directed to fragments of the isolated nucleic acid 
molecules described herein. By a fragment of an isolated DNA molecule having the 
nucleotide sequence of the deposited cDNA or the nucleotide sequence shown in SEQ ID 
NO: 16 is intended DNA fragments at least about 15nt, and more preferably at least about 
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20 nt, still more preferably at least about 30 nt, and even more preferably, at least about 
40 nt in length which are useful as diagnostic probes and primers as discussed herein. Of 
course, larger fragments 50-1500 nt in length are also useful according to the present 
invention, as are fragments corresponding to most, if not all, of the nucleotide sequence 
of the deposited cDNA or as shown in SEQ ID NO: 16. By a fragment at least 20 nt in 
length, for example, is intended fragments which include 20 or more contiguous bases 
from the nucleotide sequence of the deposited cDNA or the nucleotide sequence as 
shown in SEQ ID NO: 16. In this context "about" includes the particularly recited size, 
larger or smaller by several (5, 4, 3, 2, or 1) nucleotides, at either terminus or at both 
termini. Representative examples of CD33-like 3 polynucleotide fragments of the 
invention include, for example, fragments that comprise, or alternatively, consist of, a 
sequence from about nucleotide 1 to about 50, from about 51 to about 100, from about 
101 to about 150, from about 151 to about 200, from about 201 to about 250, from about 
251 to about 300, from about 301 to about 350, from about 351 to about 400, from about 
401 to about 450, from about 451 to about 500, from about 501 to about 550, from about 
551 to about 600, from about 601 to about 650, from about 651 to about 700, from about 
701 to about 750, from about 751 to about 800, from about 801 to about 850, from about 
851 to about 900, from about 901 to about 950, from about 951 to about 1000, from 
about 1001 to about 1050, from about 1051 to about 1100, from about 1101 to about 
1 150, from about 1 151 to about 1200, from about 1201 to about 1250, from about 1251 
to about 1300, from about 1301 to about 1350, from about 1351 to about 1400, from 
about 1401 to about 1450, from about 1451 to about 1500, from about 1501 to about 
1550, from about 1551 to about 1600, from about 1601 to about 1650, from about 1651 
to about 1700, from about 1701 to about 1748 of SEQ ID NO:16, or the complementary 
strand thereto, or the cDNA contained in the deposited gene. In this context "about" 
includes the particularly recited ranges, larger or smaller by several (5, 4, 3, 2, or 1) 
nucleotides, at either terminus or at both termini. 

SUBSTITUTE SHEET (RULE 26) 


WO 00/29435 PCTYUS99/25031 

90 

Preferred nucleic acid fragments of the present invention include nucleic acid 
molecules encoding a member selected from the group: a polypeptide comprising or 
alternatively, consisting of, the transmembrane domain (amino acid residues from about 
360 to about 376 in Figures 16A-B (amino acids from about 360 to about 376 in SEQ ID 
NO:34). Since the location of these domains have been predicted by computer analysis, 
one of ordinary skill would appreciate that the amino acid residues constituting these 
domains may vary slightly (e.g., by about 1 to 15 amino acid residues) depending on the 
criteria used to define each domain. 

In additional embodiments, the polynucleotides of the invention encode functional 
attributes of CD33-like 3. 

Preferred embodiments of the invention in this regard include fragments that 
comprise alpha-helix and alpha-helix forming regions ("alpha-regions"), beta-sheet and 
beta-sheet forming regions ("beta-regions"), turn and turn-forming regions ("turn- 
regions"), coil and coil-forming regions ("coil-regions"), hydrophilic regions, 
hydrophobic regions, alpha amphipathic regions, beta amphipathic regions, flexible 
regions, surface-forming regions and high antigenic index regions of CD33-Iike 3. The 
data representing the structural or functional attributes of CD33-like 3 set forth in Figure 
18 and/or Table VI, as described above, was generated using the various modules and 
algorithms of the DNA*STAR set on default parameters. In a preferred embodiment, the 
data presented in columns VIII, IX, XIII, and XIV of Table VI can be used to determine 
regions of CD33-Iike 3 which exhibit a high degree of potential for antigenicity. Regions 
of high antigenicity are determined from the data presented in columns VIII, IX, XIII, 
and/or XIV by choosing values which represent regions of the polypeptide which are 
likely to be exposed on the surface of the polypeptide in an environment in which antigen 
recognition may occur in the process of initiation of an immune response. 

Certain preferred regions in these regards are set out in Figure 1 8 , but may, as 
shown in Table VI, be represented or identified by using tabular representations of the 
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data presented in Figure 18 . The DNA*STAR computer algorithm used to generate 
Figure 18 (set on the original default parameters) was used to present the data in Figure 
18 in a tabular format (See Table VI). The tabular format of the data in Figure 18 is used 
to easily determine specific boundaries of a preferred region. The above-mentioned 
preferred regions set out in Figure 18 and in Table VI include, but are not limited to, 
regions of the aforementioned types identified by analysis of the amino acid sequence set 
out in Figures 16A-B. As set out in Figure 18 and in Table VI, such preferred regions 
include Garnier-Robson alpha-regions, beta-regions, turn-regions, and coil-regions, 
Chou-Fasman alpha-regions, beta-regions, and turn-regions, Kyte-Doolittle hydrophilic 
regions and Hopp-Woods hydrophobic regions, Eisenberg alpha- and beta-amphipathic 
regions, Karplus-Schulz flexible regions, Jameson-Wolf regions of high antigenic index 
and Emini surface-forming regions. Even if deletion of one or more amino acids from the 
N-terminus of a protein results in modification of loss of one or more biological 
functions of the protein, other functional activities (e.g., biological activities, ability to 
multimerize, modulate cellular interaction, or signalling pathways, etc.) may still be 
retained. For example, the ability of shortened CD33-like 3 muteins to induce and/or 
bind to antibodies which recognize the complete or mature forms of the polypeptides 
generally will be retained when less than the majority of the residues of the complete or 
mature polypeptide are removed from the N-terminus. Whether a particular polypeptide 
lacking N-terminal residues of a complete polypeptide retains such immunologic 
activities can readily be determined by routine methods described herein and otherwise 
known in the art. It is not unlikely that an CD33-like 3 mutein with a large number of 
deleted N-terminal amino acid residues may retain some biological or immunogenic 
activities. In fact, peptides composed of as few as six CD33-like 3 amino acid residues 
may often evoke an immune response. 

Accordingly, the present invention further provides polypeptides having one or 
more residues deleted from the amino terminus of the CD33-like 3 amino acid sequence 
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shown in Figures I6A-B , up to the glutamic acid residue at position number 462 and 
polynucleotides encoding such polypeptides. In particular, the present invention 
provides polypeptides comprising the amino acid sequence of residues n 1-467 of Figures 
16A-B , where nl is an integer from 2 to 462 corresponding to the position of the amino 
acid residue in Figures 16A-B (which is identical to the sequence shown as SEQ ID 
NO:34). In another embodiment, N-terminal deletions of the CD33-like 3 polypeptide 
can be described by the general formula n 2-467, where n2 is a number from 2 to 462, 
corresponding to the position of amino acid identified in Figures 16A-B . N-terminal 
deletions of the CD33-like 3 polypeptide of the invention shown as SEQ ID NO:34 
include polypeptides comprising the amino acid sequence of residues: N-terminal 
deletions of the CD33-Hke 3 polypeptide of the invention shown as SEQ ID NO:34 
include polypeptides comprising the amino acid sequence of residues: L-2 to K-467; L-3 
to K-467; L-4 to K-467; L-5 to K-467; L-6 to K-467; L-7 to K-467; P-8 to K-467; L-9 to 
K-467; L-10 toK-467; W-l 1 to K-467; G-12 to K-467; R-13 to K-467; E-14 to K-467; 
R-15 to K-467; V-16 to K-467; E-17 to K-467; G-18 to K-467 ;Q- 19 to K-467; K-20 to 
K-467; S-21 to K-467; N-22 to K-467; R-23 to K-467; K-24 to K-467; D-25 to K-467; 
Y-26 to K-467; S-27 toK-467; L-28 to K-467; T-29 to K-467; M-30 to K-467; Q-31 to 
K-467; S-32 to K-467; S-33 to K-467; V-34 to K-467; T-35 to K-467;V-36 to K-467; Q- 
37 to K-467; E-38 to K-467; G-39 to K-467; M-40 to K-467; C-41 to K-467; V-42 to K- 
467; H-43 to K-467; V-44 toK-467; R-45 to K-467; C-46 to K-467; S~47 to K-467; F-48 
to K-467; S-49 to K-467; Y-50 to K-467; P-51 to K-467; V-52 to K-467;D-53 to K-467; 
S-54 to K-467; Q-55 to K-467; T-56 to K-467; D-57 to K-467; S-58 to K-467; D-59 to 
K-467; P-60 to K-467; V-61 toK-467; H-62 to K-467; G-63 to K-467; Y-64 to K-467; 
W-65 to K-467; F-66 to K-467; R-67 to K-467; A-68 to K-467; G-69 to K-467;N-70 to 
K-467; D-71 to K-467; 1-72 to K-467; S-73 to K-467; W-74 to K-467; K-75 to K-467; 
A-76 to K-467; P-77 to K-467; V-78 toK-467; A-79 to K467; T-80 to K-467; N-81 to 
K-467; N-82 to K-467; P-83 to K-467; A-84 to K-467; W-85 to K-467; A-86 to K- 
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467;V-87 to K-467; Q-88 to K-467; E-89 to K-467; E-90 to K-467; T-91 to K-467; R-92 
to K-467; D-93 to K-467; R-94 to K-467; F-95 toK-467; H-96 to K-467; L-97 to K-467; 
L-98 to K-467; G-99 to K-467; D-100 to K-467; P-101 to K-467; Q-102 to K-467; T-103 
toK-467; K-104 to K-467; N-105 to K-467; C-106 to K-467; T-107 to K-467; L-108 to 
K-467; S-109 to K-467; 1-110 to K-467; R-l 11 toK-467; D-l 12 to K-467; A-113 to K- 
467; R-l 14 to K-467; M-l 15 to K-467; S-l 16 to K-467; D-l 17 to K-467; A-l 18 to K- 
467; G-l 19 toK-467; R-120 to K-467; Y-121 to K-467; F-122 to K-467; F-123 to K-467; 
R-l 24 to K-467; M-l 25 to K-467; E-126 to K-467; K-127 toK-467; G-128 to K-467; N- 
129 to K-467; 1-130 to K-467; K 131 to K-467; W-132 to K-467; N-133 to K-467; Y- 
134 to K-467; K-135to K-467; Y-136 to K-467; D-137 to K-467; Q-138 to K-467; L-139 
to K-467; S-I40 to K-467; V-141 to K-467; N-142 to K-467; V-143to K-467; T-144 to 
K-467; A-145 to K-467; L-146 to K-467; T-147 to K-467; H-148 to K-467; R-149 to K- 
467; P-150 to K-467; N-151to K-467; M52 to K-467; L-153 to K-467; 1-154 to K-467; 
P-155 to K-467; G-156 to K-467; T-157 to K-467; L-158 to K-467; E-159 toK-467; S- 
160 to K-467; G-161 to K-467; C-162 to K-467; F-163 to K-467; Q-164 to K-467; N- 
165 to K-467; L-166 to K-467; T-167 toK-467; C-168 to K-467; S-169 to K-467; V-170 
to K-467; P-171 to K-467; W-172 to K-467; A-173 to K-467; C-174 to K-467; E-175 
toK-467; Q-176 to K-467; G-177 to K-467; T-178 to K-467; P-179 to K-467; P-180 to 
K-467; M-181 to K-467; 1-182 to K-467; S-183 toK-467; W-184 to K-467; M-185 to K^ 
467; G-186 to K-467; T-187 to K-467; S-188 to K-467; V-189 to K-467; S-190 to K- 
467; P-191 toK-467; L-192 to K-467; H-193 to K-467; P-194 to K-467; S-195 to K-467; 
T-196 to K-467; T-197 to K-467; R-198 to K-467; S-199 toK-467; S-200 to K-467; V- 
201 to K-467; L-202 to K-467; T-203 to K-467; L-204 to K-467; 1-205 to K-467; P-206 
to K-467; Q-207 toK-467; P-208 to K-467; Q-209 to K-467; H-210 to K-467; H-21 1 to 
K-467; G-212 to K-467; T-213 to K-467; S-214 to K-467; L-215 toK-467; T-216 to K- 
467; C-217 to K-467; Q-218 to K-467; V-219 to K-467; T-220 to K-467; L-221 to K- 
467; P-222 to K-467; G-223 toK-467; A-224 to K-467; G-225 to K-467; V-226 to K- 
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467; T-227 to K-467; T-228 to K-467; N-229 to K-467; R-230 to K-467; T-231 toK-467; 
1-232 to K-467; Q-233 to K-467; L-234 to K-467; N-235 to K-467; V-236 to K-467; S- 
237 to K-467; Y-238 to K-467; P-239 toK-467; P-240 to K-467; Q-241 to K-467; N-242 
to K-467; L-243 to K-467; T-244 to K-467; V-245 to K-467; T-246 to K-467; V-247 
toK-467; F-248 to K-467; Q-249 to K-467; G-250 to K-467; E-251 to K-467; G-252 to 
K-467; T-253 to K-467; A-254 to K-467; S-255 toK-467; T-256 to K-467; A-257 to K- 
467; L-258 to K-467; G-259 to K-467; N-260 to K-467; S-261 to K-467; S-262 to K- 
467; S-263 toK-467; L-264 to K-467; S-265 to K-467; V-266 to K-467; L-267 to K-467; 
E-268 to K-467; G-269 to K-467; Q-270 to K-467; S-271 toK-467; L-272 to K-467; R- 
273 to K-467; L-274 to K-467; V-275 to K-467; C-276 to K-467; A-277 to K-467; V- 
278 to K-467; D-279 toK-467; S-280 to K-467; N-281 to K-467; P-282 to K-467; P-283 
to K-467; A-284 to K-467; R-285 to K-467; L-286 to K-467; S-287 toK-467; W-288 to 
K-467; T-289 to K-467; W-290 to K-467; R-291 to K-467; S-292 to K-467; L-293 to K- 
467; T-294 to K-467; L-295 toK-467; Y-296 to K-467; P-297 to K-467; S-298 to K-467; 
Q-299 to K-467; P-300 to K-467; S-301 to K-467; N-302 to K-467; P-303 toK-467; L- 
304 to K-467; V-305 to K-467; L-306 to K-467; E-307 to K-467; L-308 to K-467; Q-309 
to K-467; V-310 to K-467; H-31 1 toK-467; L-312 to K-467; G-313 to K-467; D-3I4 to 
K-467; E-315 to K-467; G-316 to K-467; E-317 to K-467; F-318 to K-467; T-319 toK- 
467; C-320 to K-467; R-321 to K-467; A-322 to K-467; Q-323 to K-467; N-324 to K 
467; S-325 to K-467; L-326 to K-467; G-327 toK-467; S-328 to K-467; Q-329 to K-467; 
H-330 to K-467; V-331 to K-467; S-332 to K-467; L-333 to K-467; N-334 to K-467; L- 
335 toK-467; S-336 to K-467; L-337 to K-467; Q-338 to K-467; Q-339 to K-467; E-340 
to K-467; Y-341 to K-467; T-342 to K-467; G-343 toK-467; K-344 to K-467; M-345 to 
K-467; R-346 to K-467; P-347 to K-467; V-348 to K-467; S-349 to K-467; G-350 to K- 
467; V-351 toK-467; L-352 to K-467; L-353 to K-467; G-354 to K-467; A-355 to K- 
467; V-356 to K-467; G-357 to K-467; G-358 to K-467; A-359 toK-467; G-360 to K- 
467; A-361 to K-467; T-362 to K-467; A-363 to K-467; L-364 to K-467; V-365 to K- 


SUBSTITUTE SHEET (RULE 26) 


WO 00/29435 PCT/US99/25031 

95 

467; F-366 to K-467; L-367 toK-467; S-368 to K-467; F-369 to K-467; C-370 to K-467; 
V-371 to K-467; 1-372 to K-467; F-373 to K-467; 1-374 to K-467; V-375 toK-467; V- 
376 to K-467; R-377 to K-467; S-378 to K-467; C-379 to K-467; R-380 to K-467; K-381 
to K-467; K-382 to K-467; S-383 toK-467; A-384 to K-467; R-385 to K-467; P-386 to 
K-467; A-387 to K-467; A-388 to K-467; D-389 to K-467; V-390 to K-467; G-391 toK- 
467; D-392 to K-467; 1-393 to K-467; G-394 to K-467; M-395 to K-467; K-396 to K- 
467; D-397 to K-467; A-398 to K-467; N-399 toK-467; T-400 to K-467; 1-401 to K-467; 
R-402 to K-467; G-403 to K-467; S-404 to K-467; A-405 to K-467; S-406 to K-467; Q- 
407 toK-467; G-408 to K-467; N-409 to K-467; L-410 to K-467; T-41 1 to K-467; E-412 
to K-467; S-413 to K-467; W-414 to K-467; A-415 toK-467; D-416 to K-467; D-417 to 
K-467; N-418 to K-467; P-419 to K-467; R-420 to K-467; H-421 to K-467; H-422 to K- 
467; G-423to K-467; E-424 to K-467; A-425 to K-467; A-426 to K-467; H-427 to K- 
467; S-428 to K-467; S-429 to K-467; G-430 to K-467; E-431to K-467; E-432 to K-467; 
R-433 to K-467; E-434 to K-467; 1-435 to K-467; Q-436 to K-467; Y-437 to K-467; A- 
438 to K-467; P-439to K-467; L-440 to K-467; S-441 to K-467; F-442 to K-467; H-443 
to K-467; K-444 to K-467; G-445 to K-467; E-446 to K-467; P-447to K-467; Q-448 to 
K-467; D-449 to K-467; L-450 to K-467; S-451 to K-467; G-452 to K-467; Q-453 to K- 
467; E-454 to K-467; A-455to K-467; T-456 to K-467; N-457 to K-467; N-458 to K- 
467; E-459 to K-467; Y-460 to K-467; S-461 to K-467; E-462 to K-467; of SEQ ID 
NO;34. Polypeptides encoded by these polynucleotides are also encompassed by the 
invention. 

Also as mentioned above, even if deletion of one or more amino acids from the 
C-terminus of a protein results in modification or loss of one or more biological 
functions of the protein, other functional activities may still be retained. For example the 
ability of the shortened CD33-Iike 3 mutein to induce and/or bind to antibodies which 
recognize the complete or mature forms of the polypeptide generally will be retained 
when less than the majority of the residues of the complete or mature polypeptide are 
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removed from the C-terminus. Whether a particular polypeptide lacking C-terminal 
residues of a complete polypeptide retains such immunologic activities can readily be 
determined by routine methods described herein and otherwise known in the art. It is not 
unlikely that a CD33-like 3 mutein with a large number of deleted C-terminal amino acid 
residues may retain some biological or immunogenic activities. In fact, peptides 
composed of as few as six CD33-Jike 3 amino acid residues may often evoke an immune 
response. 

Accordingly, the present invention further provides polypeptides having one or 
more residues deleted from the carboxy terminus of the amino acid sequence of the 
CD33-like 3 polypeptide shown in Figures 16A-B , up to the leucine residue at position 
number 6, and polynucleotides encoding such polypeptides. In particular, the present 
invention provides polypeptides comprising the amino acid sequence of residues 1-ml of 
Figure 1 , where ml is an integer from 6 to 467 corresponding to the position of the 
amino acid residue in Figures 16A-B . 

Moreover, the invention provides polynucleotides encoding polypeptides comprising, or 
alternatively consisting of, the amino acid sequence of C-terminal deletions of the CD33- 
like 3 polypeptide of the invention shown as SEQ ID NO:34 include polypeptides 
comprising the amino acid sequence of residues: M-I to P-466; M-l to 1-465; Ml to K- 
464; M-l to 1-463; M-l to E-462; M-l to S-461; M-l to Y-460; M-l toE-459; M-l to N- 
458; M-l to N-457; M-l to T-456; M-I to A-455; M-l to E-454; M-l to Q-453; M-l to 
G-452; M-l to S-451; M-l toL-450; M-l to D-449; M-l to Q-448; M-l to P-447; M-l to 
E-446; M-l to G-445; M-l to K-444; M-l to H-443; M-l to F-442; M-l toS-441; M-l to 
L-440; M-l to P-439; M-l to A-438; M-l to Y-437; M-l to Q-436; M-l to 1-435; M-l to 
E-434; M-l to R-433; M-l toE-432; M-l to E-431; M-l to G-430; M-l to S-429; M-l to 
S-428; M-l to H-427; M-l to A-426; M-l to A-425; M-l to L-424; M-l toG-423; M-l to 
H-422; M-l to H-421; M-l to R-420; M-l to P-419; M-l to N-418; M-l to D-417; M-l 
to D-416; M-l to A-415; M-l toW-414; M-l to S-4I3; M-l to E-412; M-l toT-411; M-l 
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to L-410; M-l to N-409; M-l to G-408; M-l to Q-407; M-l to S-406; M-l toA-405; M-l 
to S-404; M-l to G-403; M-l to R-402; M-l to 1-401; M-l to T-400; M-l to N-399; M-l 
to A-398; M-l to D-397; M-l toK-396; M-l to M-395; M-l to G-394; M-l to 1-393; M-l 
to D-392; M-l to G-391; M-l to V-390; M-l to D-389; M-l to A-388; M-l toA-387; M- 
1 to P-386; M-l to R-385; M-l to A-384; M-l to S-383; M-l to K-382; M-l to K-381; 
M-l to R-380; M-l to C-379; M-l toS-378; M-l to R-377; M-l to V-376; M-l to V-375; 
M-l to 1-374; M-l to F-373; M-l to 1-372; M-l to V-371; M-l to C-370; M-l toF-369; 
M-l to S-368; M-l to L-367; M-l to F-366; M-l to V-365; M-l to L-364; M-l to A-363; 
M-l to T-362; M-l to A-361; M-I toG-360; M-l to A-359; M-l to G-358; M-l to G- 
357; M-l to V-356; M-l to A-355; M-l to G-354; M-l to L-353; M-l to L-352; M-l 
toV-351; M-l to G-350; M-l to S-349; M-l to V-348; M-l to P-347; M-l to R-346; M-l 
to M-345; M-l to K-344; M-l to G 343; M-l toT-342; M-l to Y-341 ; M-l to E-340; M- 
1 to Q-339; M-l to Q-338; M-l to L-337; M-l to S-336; M-l to L-335; M-l to N-334; 
M-l toL-333; M-l to S-332; M-l to V-331; M-l to H-330; M-l to Q-329; M-l to S-328; 
M-l to G-327; M-l to L-326; M-l to S-325; M-l toN-324; M-l to Q-323; M-l to A-322; 
M i to R-321; M-l to C-320; M-l to T-319; M-l to F-318; M-l to E-317; M-l to G-316; 
M-l toE-315;M-l toD-314; M-l to G-313;M-1 toL-312; M l toH-311;M-l to V-310; 
M-l to Q-309; M-l to L-308; M-l to E-307; M-l toL-306; M-l to V-305; M-l to L-304; 
M-l to P-303; M-l to N-302; M-l to S-301; M-l to P-300; M-l to Q-299; M-l to S-298; 
M-l toP-297; M-l to Y-296; M-l to L-295; M-l to T-294; M-l to L-293; M-l to S-292; 
M-l to R-291; M-l to W-290; M-l to T-289; M-l toW-288; M-l to S-287; M-l to L- 
286; M-l to R-285; M-l to A-284; M-l to P-283; M-l to P-282; M-l to N-281; M-l to 
S-280; M-l toD-279; M-l to V-278; M-l to A-277; M-l to C-276; M-l to V-275; M-l to 
L-274; M-l to R-273; M-l to L-272; M-l to S-271; M-l toQ-270; M-i to G-269; M-l to 
E-268; M-l to L-267; M-l to V-266; M-l to S-265; M-l to L-264; M-l to S-263; M-l to 
S-262; M-l toS-261; M-l to N-260; M-l to G-259; M-l to L-258; M-l to A-257; M-l to 
T-256; M-l to S-255; M-l to A-254; M-l to T-253; M-l toG-252; M-l to E-251; M-l to 
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G-250; M-l to Q-249; M-l to F-248; M-l to V-247; M-l to T-246; M-l to V-245; M-l 
to T-244; M-l toL-243; M-l to N-242; M-l to Q-241; M-l to P-240; M-l to P-239; M-l 
to Y-238; M-l to S-237; M-l to V-236; M-l to N-235; M-l toL-234; M-l to Q-233; M-l 
to 1-232; M-l to T-231; M-l to R-230; M-l to N-229; M-l to T-228; M-l to T-227; M-l 
to V 226; M-l toG-225; M-l to A-224; M-l to G-223; M-l to P-222; M-l to L-221; M-l 
toT-220;M-l to V-219;M-1 to Q-218;M-1 to C-217;M-1 toT-216; M-l to L-215;M-1 
toS-214; M-l toT-213;M-l to G-212;M-1 toH-211;M-l to H-210; M-l to Q-209; M- 
1 to P-208; M-l toQ-207; M-l to P-206; M-l to 1-205; M-l to L-204; M-l to T-203; M-l 
to L-202; M-l to V-201; M-l to S-200; M-l to S-199; M-l toR-198; M-l to T-197; M-l 
toT-196; M-l to S-195; M-l to P-194; M-l to H-193;M-1 toL-192;M-l to P-191;M-1 
to S-190; M-l toV-189; M-l toS-188;M-l toT-187;M-l to G-186;M-1 to M-l 85; M-l 
to W- 184; M-l to S-183;M-1 to 1-182; M-l to M-181;M-1 toP-180; M-l to P-179;M-1 
to T-178; M-l to G-177; M-l to Q-176; M-l to E-175; M-l to C-174; M-l to A-173; M- 
1 to W-172; M-l toP-171; M-l to V-170; M- 1 to S-169; M-l to C-168; M-l to T-167; 
M-l to L-166;M-1 toN-165;M-l to Q-164; M-l to F- 163; M-l toC-162; M-l to G-161; 
M-l to S-160; M-l toE-159;M-l to L-158; M-l toT-157; M-l to G-156; M-l to P-155; 
M-l to 1-154; M-l toL-153; M-l to 1-152; M-l to N-151; M-l to P-150; M-l to R-149; 
M-l to H-148; M-l toT-147; M-l to L-146; M-l to A-145; M-l toT-144; M-l to V-143; 
M-l to N-142; M-l to V-141; M-l to S-140; M-l to L-139; M-l to Q-138; M-l to D- 
137; M-l to Y-136; M-l toK-135; M-l to Y-134; M-l to N-133; M-l to W-132; M-l to 
K-131; M-l to 1-130; M-l to N-129; M l to G-128; M-l to K-127; M-l toE-126; M-l to 
M-125; M-l to R-124; M-l to F-123; M-l to F-122; M-l to Y-121; M-l to R-120; M-l 
to G-119; M-l to A-118;M-1 toD-117; M-l to S-116;M-1 toM-115;M-l to R-114; M- 
1 to A-113:M-1 to D-112; M-l to R-111;M-1 to I-110;M-1 toS-109; M-l toL-108; M- 
1 toT-107;M-l to C-106; M-l to N-105; M-l to K-104; M-l toT-103; M-l to Q-102; 
M-l to P-101; M-l to D-100; M-l toG-99; M-l to L-98; M-l to L-97; M l to H-96; M-l 
to F-95; M-l to R-94; M-l to D-93; M-l to R-92; M-l to T-91; M-l to E-90; M-l toE- 


SUBSTITUTE SHEET (RULE 26) 


WO 00/29435 PCT/US99/25031 

99 

89; M-l to Q-88; M-l to V-87; M-l to A-86; M-l to W-85; M-l to A-84; M-l to P-83; 
M-l to N-82; M-l to N-81; M-l to T-80; M-l toA-79; M-l to V-78; M-l to P-77; M-l to 
A-76; M-l to K-75; M-l to W-74; M-l to S-73; M-l to 1-72; M-l to D-71; M-l to N-70; 
M l toG-69; M-l to A-68; M-l to R-67; M-l to F-66; M-l to W-65; M-l to Y-64; M-l 
to G-63; M-l to H-62; M-l to V-61; M-l to P-60; M-l toD-59; M-l to S-58; M-l to D- 
57; M-l to T-56; M-l to Q-55; M-l to S-54; M-i to D-53; M-l to V-52; M-l to P-51; M- 
1 to Y-50; M-l toS-49; M-l to F-48; M-l to S-47; M-l to C-46; M-l to R-45; M-l to V- 
44; M-l to H-43; M-l to V-42; M-l to C-41; M-l to M-40; M-l toG-39; M-l to E-38; 
M-l to Q-37; M-l to V-36; M-l to T-35; M-l to V-34; M-l to S-33; M-l to S-32; M-l to 
Q-31; M-l to M-30; M-l toT 29; M-l to L-28; M-l to S-27; M-l to Y-26; M-l to D-25; 
M-l to K-24; M-l to R-23; M-l to N-22; M-I to S-21; M-l to K-20; M-l toQ-19; M-l to 
G-18; M-l toE-17;M-l to V-16; M-l toR-15; M-l toE-14; M-l to R-13; M-l toG-12; 
M-l to W-11;M-1 toL-10;M-l toL-9; M-l to P-8;M-1 toL-7;M-l to L-6; of SEQ ID 
NO:34. Polypeptides encoded by these polynucleotides are also encompassed by the 
invention. 

In addition, the invention provides nucleic acid molecules having nucleotide 
sequences related to extensive portions of SEQ ID NO: 16 which have been determined 
from the following related cDNA genes: HGBAY02R (SEQ ID NO: 100) and 
HLYBY62R (SEQ ID NO: 101). 

Based on the sequence similarity to the human CD33L1, translation product of 
this gene is expected to share at least some biological activities with CD33 proteins, and 
specifically myeloid modulatory proteins and/or siglec proteins. Such activities are 
known in the art, some of which are described elsewhere herein. 

Specifically, polynucleotides and polypeptides of the invention are also useful for 
modulating the differentiation of normal and malignant cells, modulating the 
proliferation and/or differentiation of cancer and neoplastic cells, and modulating the 
immune response. Polynucleotides and polypeptides of the invention may represent a 


SUBSTITUTE SHEET (RULE 26) 


WO 00/29435 PCT/US99/25031 

100 

diagnostic marker for hematopoietic and immune diseases and/or disorders. The full- 
length protein should be a secreted protein, based upon homology to the CD33 family. 
Therefore, it is secreted into serum, urine, or feces and thus the levels is assayable from 
patient samples. Assuming specific expression levels are reflective of the presence of 
immune disorders, this protein would provide a convenient diagnostic for early detection. 
In addition, expression of this gene product may also be linked to the progression of 
immune diseases, and therefore may itself actually represent a therapeutic or therapeutic 
target for the treatment of cancer. 

Polynucleotides and polypeptides of the invention may play an important role in 
the pathogenesis of human cancers and cellular transformation, particularly those of the 
immune and hematopoietic systems. Polynucleotides and polypeptides of the invention 
may also be involved in the pathogenesis of developmental abnormalities based upon its 
potential effects on proliferation and differentiation of cells and tissue ceil types. Due to 
the potential proliferating and differentiating activity of said polynucleotides and 
polypeptides, the invention is useful as a therapeutic agent in inducing tissue 
regeneration, for treating inflammatory conditions (e.g., inflammatory bowel syndrome, 
diverticulitis, etc.). Moreover, the invention is useful in modulating the immune 
response to aberrant polypeptides, as may exist in rapidly proliferating cells and tissue 
cell types, particularly in adenocarcinoma cells, and other cancers. 

This gene is expressed predominantly on NK cells, and to a lesser extent on T- 
cells. Therefore, polynucleotides and polypeptides of the invention are useful as reagents 
for differential identification of the tissue(s) or cell type(s) present in a biological sample 
and for diagnosis of the following diseases and conditions which include, but are not 
limited to, immune disorders and cancer, as well as the immunodiagnosis of acute 
leukemias. Similarly, polypeptides and antibodies directed to these polypeptides are 
useful to provide immunological probes for differential identification of the tissue(s) or 
cell type(s). For a number of disorders of the above tissues or cells, particularly of the 
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immune system, and breast tissue, expression of this gene at significantly higher or 
tower levels is detected in certain tissues or ceil types (e.g. immune, cancerous and 
wounded tissues) or bodily fluids (e.g., serum, plasma, urine, synovial fluid or spinal 
fluid) or another tissue or cell sample taken from an individual having such a disorder, 
relative to the standard gene expression level, i.e., the expression level in healthy tissue 
from an individual not having the disorder. 

Preferred polypeptides of the present invention comprise immunogenic epitopes 
shown in SEQ ID NO: 34 as residues: Gly-12 to Tyr-26, VaI-52 to Asp-59, Gln-88 to 
Asp-93, Arg-124 to Asn-129, His-193 to Arg-198, GIn-207 to Thr-213, Gln-338 to Arg^ 
346, Ser-378 to AIa-384, Ser-413 to Arg-420, Ser-428 to Glu-434, His-443 to Ser-451, 
Glu-454 to Ser-461 . Polynucleotides encoding said polypeptides are also provided. 

The tissue distribution in NK cells, in combination with the homology to siglec 
family of proteins indicates the protein product of this gene is useful for the diagnosis 
and treatment of a variety of immune system disorders. NK cells are bone-marrow 
derived granular lymphocytes that play an important role in natural immunity to 
infectious diseases and have the capacity to kill certain vi rally -infected cells and tumor 
cells that have down -regulated MHC Class-I antigen expression. The killing and 
proinflammatory activities of NK cells are regulated through a variety of cell surface 
receptors that can mediate either activity or inhibitory signals. The best understood 
receptors are those that recognize MHC Class I molecules at the cell surface and deliver 
a negative signal, thereby protecting normal host cells from cytotoxicity. These 
receptors can belong either to the C-type lectin superfamily or the Ig superfamily, 
although in humans the majority are members of the Ig superfamily known as killer cell 
Ig-like receptors (KIRs). Representative uses are described in the "Immune Activity" and 
"infectious disease" sections below, in Example 11, 13, 14, 16, 18, 19, 20, and 27, and 
elsewhere herein. Briefly, the expression indicates a role in regulating the proliferation; 
survival; differentiation; and/or activation of hematopoietic cell lineages, including blood 
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stem cells. Involvement in the regulation of cytokine production, antigen presentation, 
or other processes indicates a usefulness for treatment of cancer (e.g. by boosting 
immune responses). Expression in cells of lymphoid origin, indicates the natural gene 
product is involved in immune functions. Therefore it would also be useful as an agent 
for immunological disorders including arthritis, asthma, immunodeficiency diseases 
such as AIDS, leukemia, rheumatoid arthritis, granulomatous Disease, inflammatory 
bowel disease, sepsis, acne, neutropenia, neutrophilia, psoriasis, hypersensitivities, such 
as T-cell mediated cytotoxicity; immune reactions to transplanted organs and tissues, 
such as host-versus-graft and graft-versus-host diseases, or autoimmunity disorders, such 
as autoimmune infertility, lense tissue injury, demyelination, systemic lupus 
erythematosis, drug induced hemolytic anemia, rheumatoid arthritis, Sjogren's Disease, 
and scleroderma. Moreover, the protein may represent a secreted factor that influences 
the differentiation or behavior of other blood cells, or that recruits hematopoietic cells 
to sites of injury. Thus, this gene product is thought to be useful in the expansion of 
stem cells and committed progenitors of various blood lineages, and in the differentiation 
and/or proliferation of various cell types. Based upon the tissue distribution of this 
protein, antagonists directed against this protein is useful in blocking the activity of this 
protein. Accordingly, preferred are antibodies which specifically bind a portion of the 
translation product of this gene. 

Also provided is a kit for detecting tumors in which expression of this protein 
occurs. Such a kit comprises in one embodiment an antibody specific for the translation 
product of this gene bound to a solid support. Also provided is a method of detecting 
these tumors in an individual which comprises a step of contacting an antibody specific 
for the translation product of this gene to a bodily fluid from the individual, preferably 
serum, and ascertaining whether antibody binds to an antigen found in the bodily fluid. 
Preferably the antibody is bound to a solid support and the bodily fluid is serum. The 
above embodiments, as well as other treatments and diagnostic tests (kits and methods), 
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are more particularly described elsewhere herein. Furthermore, the protein may also be 
used to determine biological activity, raise antibodies, as tissue markers, to isolate 
cognate ligands or receptors, to identify agents that modulate their interactions, in 
addition to its use as a nutritional supplement. Protein, as well as, antibodies directed 
against the protein may show utility as a tumor marker and/or immunotherapy targets for 
the above listed tissues. 

Many polynucleotide sequences, such as EST sequences, are publicly available 
and accessible through sequence databases. Some of these sequences are related to SEQ 
ID NO: 16 and may have been publicly available prior to conception of the present 
invention. Preferably, such related polynucleotides are specifically excluded from the 
scope of the present invention. To list every related sequence is cumbersome. 
Accordingly, preferably excluded from the present invention are one or more 
polynucleotides comprising a nucleotide sequence described by the general formula of a- 
b, where a is any integer between 1 to 1734 of SEQ ID NO: 16, b is an integer of 15 to 
1748, where both a and b correspond to the positions of nucleotide residues shown in 
SEQ ID NO: 16, and where b is greater than or equal to a + 14. 


FEATURES OF PROTEIN ENCODED BY GENE NO: 7 

This invention relates to newly identified polynucleotides, polypeptides encoded 
by such polynucleotides, the use of such polynucleotides and polypeptides, as well as the 
production of such polynucleotides and polypeptides. The polypeptide of the present 
invention has been putatively identified as a human integrin alpha 1 1 homolog derived 
from a human osteoblast II cDNA library. More particularly, the polypeptide of the 
present invention has been putatively identified as a human integrin alpha 11-subunit 
homolog, sometimes hereafter referred to as "integrin alpha 11", "integrin alpha 11- 
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subunit", "all", "Al l-subunit M , and/or "Imegrin al l-subunit". The invention also relates 
to inhibiting the action of such polypeptides. 

The integrins are a large family of cell adhesion molecules consisting of noncovalentty 
associated ab heterodimers. 

We have cloned and sequenced a novel human integrin a -subunit cDNA, 
designated all. The all cDNA encodes a protein with a 22 amino acid signal peptide, a 
large 1 120 residue extracellular domain that contains an I-domain of 207 residues and is 
linked by a transmembrane domain to a short cytoplasmic domain of 24 amino acids. The 
deduced al 1 protein shows the typical structural features of integrin a-subunits and is 
similar to a distinct group of a-subunits from collagen-binding integrins. However, it 
differs from most integrin a-chains by an incompletetely preserved cytoplasmic GFFKR 
motif. 

The human ITGA1 1 gene was located to bands q22.3-23 on chromosome 15, and 
its transcripts were found predominantly in bone, cartilage as well as in cardiac and 
skeletal muscle. Expression of the 5.5 kilobase all mRNA was also detectable in ovary 
and small intestine. 

All vertebrate cells express members of the integrin family of cell adhesion molecules, 
which mediate cellular adhesion to other cells and extracellular subtratum, cell migration 
and participate in important physiologic processes from signal transduction to cell 
proliferation and differentiation {Hynes, 92; Springer, 92 }. 

Integrins are structurally homologous heterodimeric type-I membrane 
glycoproteins formed by the noncovalent association of one of eight b -subunits with one 
of the 17 different a-subunits described to date, resulting in at least 22 different ab 
complexes. Their binding specificities for cellular and extracellular ligands are 
determined by both subunits and are dynamically regulated in a cell-type-specific mode 
by the cellular environment as well as by the developmental and activation state of the 
cell {Diamond and Springer, 94}. In integrin a -subunits, the aminoterminal region of the 
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large extracellular domain consists of a seven-fold repeated structure which is predicted 
to fold into a b -propeller domain {Corbi et al., 1987; Springer, 1997 }. The three or four 
C-terminal repeats contain putative divalent cation binding motifs that are thought to be 
important for ligand binding and subunit association {Diamond and Springer, 94}. The 
al, a2, a 10, aD, aE, aL, aM and aX-subunits contain an approximately 200 amino acid I- 
domain inserted between the second and third repeat that is not present in other a-chains 
{Larson et aL, 1989}. Several isolated 1-domains have been shown to independently bind 
the ligands of the parent integrin heterodimer {Kamata and Takada, 1994; Randi and 
Hogg, 1994}. The a3, a5-8, allb and aV-subunits are proteolytically processed at a 
conserved site into disulphide-linked heavy and light chains, while the a4-subunit is 
cleaved at a more aminoterminal site into two fragments that remain noncovalently 
associated {Hemler et ah, 90}. Additional a-subunit variants are generated by alternative 
splicing of primary transcripts {Ziober et al., 93; Delwel et al., 95; Leung et al., 98}. 

The extracellular domains of a-integrin subunits are connected by a single 
spanning transmembrane domain to short, diverse cytoplasmic domains whose only 
conserved feature is a membrane-proximal KXGFF(K/R)R motif {Sastry and Horwitz, 
1993}. The cytoplasmic domains have been implicated in the cell-type-specific 
modulation of integrin affinity states {Williams et al., 1994}. 

The polypeptide of the present invention has been putatively identified as a 
member of the integrin family and has been termed integrin alpha 1 1 subunit ("al 1")- 
This identification has been made as a result of amino acid sequence homology to the 
human integrin alpha 1 subunit (See Genbank Accession No. gi|346210). 

Figures 19A-F show the nucleotide (SEQ ID NO: 17) and deduced amino acid 
sequence (SEQ ID NO:35) of all. Predicted amino acids from about 1 to about 22 
constitute the predicted signal peptide (amino acid residues from about 1 to about 22 in 
SEQ ID NO:35) and are represented by the underlined amino acid regions; amino acids 
from about 666 to about 682, and/or amino acids from about 1 145 to about 1 161 
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constitute the predicted transmembrane domains (amino acids from about 666 to about 
682, and/or amino acids from about 1 145 to about 1 161 in SEQ ID NO:35) and are 
represented by the double underlined amino acids; and amino acids from about 64 to 
about 96 constitute the predicted immunoglobulin and major histocompatibility complex 
protein domain (amino acids from about 64 to about 96 in SEQ ID NO: 3 5) and are 
represented by the bold amino acids. 

Figure 20 shows the regions of similarity between the amino acid 
sequences of the integrin alpha 1 1 subunit (all) protein (SEQ ID NO:35) and the human 
integrin alpha 1 subunit (SEQ ID NO: 103). 

Figure 21 shows an analysis of the integrin alpha 1 1 subunit (al 1) amino acid 
sequence. Alpha, beta, turn and coil regions; hydrophilicity and hydrophobicity; 
amphipathic regions; flexible regions; antigenic index and surface probability are shown. 

A polynucleotide encoding a polypeptide of the present invention is obtained 
from human ovary ,small intestine, fetal heart, fetal brain, large intestine, osteoblasts, 
human trabelcular bone cells, messangial cells, adipocytes, osteosarcoma, 
chondrosarcoma, breast cancer cells, and bone marrow tissues and cells. The 
polynucleotide of this invention was discovered in a human osteoblast II cDNA library. 
Its translation product has homology to the characteristic immunoglobulin and major 
histocompatibility complex protein domain of integrin family members. As shown in 
Figures 19A-F, all has transmembrane domains (the transmembrane domains comprise 
amino acids 666 - 682 and/or 1 145 - 1161 of SEQ ID NO:35; which correspond to 
amino acids 666 - 682 and/or 1 145 - 1 161 of Figures 19A-F) with strong conservation 
between other members of the integrin family. The polynucleotide contains an open 
reading frame encoding the al i polypeptide of 1 189 amino acids. The present invention 
exhibits a high degree of homology at the amino acid level to the human integrin alpha 1 
subunit (as shown in Figure 20). 

Preferred polypeptides of the invention comprise the following amino acid 
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sequence: TNGYQKTGDVYKCPVIHGNCTKLNLGRVTLSNV (SEQ ID NO: 102). 
Polynucleotides encoding these polypeptides are also provided. 

The present invention provides isolated nucleic acid molecules comprising a 
polynucleotide encoding the al 1 polypeptide having the amino acid sequence shown in 
Figures 19A-F (SEQ ID NO:35). The nucleotide sequence shown in Figures 19A-F 
(SEQ ID NO:35) was obtained by sequencing a cloned cDNA (HOHBY69), which was 
deposited on November 17 at the American Type Culture Collection, and given 
Accession Number 203484. 

The present invention is further directed to fragments of the isolated nucleic acid 
molecules described herein. By a fragment of an isolated DNA molecule having the 
nucleotide sequence of the deposited cDN A or the nucleotide sequence shown in SEQ ID 
NO:17 is intended DNA fragments at least about 15nt, and more preferably at least about 
20 nt, still more preferably at least about 30 nt, and even more preferably, at least about 
40 nt in length which are useful as diagnostic probes and primers as discussed herein. Of 
course, larger fragments 50-1500 nt in length are also useful according to the present 
invention, as are fragments corresponding to most, if not all, of the nucleotide sequence 
of the deposited cDNA or as shown in SEQ ID NO: 17. By a fragment at least 20 nt in 
length, for example, is intended fragments which include 20 or more contiguous bases 
from the nucleotide sequence of the deposited cDNA or the nucleotide sequence as 
shown in SEQ ID NO: 17. In this context "about" includes the particularly recited size, 
larger or smaller by several (5, 4, 3, 2, or 1 ) nucleotides, at either terminus or at both 
termini. Representative examples of al 1 polynucleotide fragments of the invention 
include, for example, fragments that comprise, or alternatively, consist of, a sequence 
from about nucleotide 1 to about 50, from about 51 to about 100, from about 101 to 
about 150, from about 151 to about 200, from about 201 to about 250, from about 251 to 
about 300, from about 301 to about 350, from about 351 to about 400, from about 401 to 
about 450, from about 451 to about 500, from about 501 to about 550, from about 551 to 
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about 600, from about 601 to about 650, from about 651 to about 700, from about 701 to 
about 750, from about 751 to about 800, from about 801 to about 850, from about 851 to 
about 900, from about 901 to about 950, from about 951 to about 1000, from about 1001 
to about 1050, from about 1051 to about 1100, from about 1 101 to about 1150, from 
about 1151 to about 1200, from about 1201 to about 1250, from about 1251 to about 
1300, from about 1301 to about 1350, from about 1351 to about 1400, from about 1401 
to about 1450, from about 1451 to about 1500, from about 1501 to about 1550, from 
about 1551 to about 1600, from about 1601 to about 1650, from about 1651 to about 
1700, from about 1701 to about 1750, from about 1751 to about 1800, from about 1801 
to about 1850, from about 1851 to about 1900, from about 1901 to about 1950, from 
about 1951 to about 2000, from about 2001 to about 2050, from about 2051 to about 
2100, from about 2101 to about 2150, from about 2151 to about 2200, from about 2201 
to about 2250, from about 2251 to about 2300, from about 2301 to about 2350, from 
about 2351 to about 2400, from about 2401 to about 2450, from about 2451 to about 
2500, from about 2501 to about 2550, from about 2551 to about 2600, from about 2601 
to about 2650, from about 265 1 to about 2700, from about 2701 to about 2750, from 
about 2751 to about 2800, from about 2801 to about 2850, from about 2851 to about 
2900, from about 2901 to about 2950, from about 2951 to about 3000, from about 3001 
to about 3050, from about 3051 to about 3100, from about 3101 to about 3150, from 
about 3151 to about 3200, from about 3201 to about 3250, from about 3251 to about 
3300, from about 3301 to about 3350, from about 3351 to about 3400, from about 3401 
to about 3450, from about 3451 to about 3500, from about 3501 to about 3550, from 
about 3551 to about 3600, from about 3601 to about 3650, from about 3651 to about 
3700, from about 3701 to about 3750, from about 3751 to about 3800, from about 3801 
to about 3850, from about 3851 to about 3900, from about 3901 to about 3950, from 
about 3951 to about 4000, from about 4001 to about 4050, from about 4051 to about 
4100, from about 4101 to about 4150, from about 4151 to about 4200, from about 4201 
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to about 4250, from about 4251 to about 4300, from about 4301 to about 4350, from 
about 4351 to about 4400, from about 4401 to about 4450, from about 4451 to about 
4500, from about 4501 to about 4550, from about 4551 to about 4600, from about 4601 
to about 4650, from about 4651 to about 4700, from about 4701 to about 4750, from 
about 4751 to about 4800, from about 4801 to about 4850, from about 4851 to about 
4900, from about 4901 to about 4950, from about 4951 to about 4995, from about, from 
about 1 to about 236, from about 144 to about 188, from about 231 to about 276 of SEQ 
ID NO: 17, or the complementary strand thereto, or the cDNA contained in the deposited 
gene. In this context "about" includes the particularly recited ranges, larger or smaller by 
several (5, 4, 3, 2, or 1) nucleotides, at either terminus or at both termini. 

Preferred nucleic acid fragments of the present invention include nucleic acid 
molecules encoding a member selected from the group: a polypeptide comprising or 
alternatively, consisting of, any one of the transmembrane domains (amino acid residues 
from about 666 to about 682 and/or 1 145 to about 1161 in Figures 19A-F (amino acids 
from about 666 to about 682 and/or 1 145 to about 1 161 in SEQ ID NO:35), in addition to 
the immunoglobulin and major histocompatibility complex protein domain (amino acid 
residues from about 64 to about 96 in Figures 19A-F (amino acids from about 64 to 
about 96 in SEQ ID NO:35). Since the location of these domains have been predicted by 
computer analysis, one of ordinary skill would appreciate that the amino acid residues 
constituting these domains may vary slightly (e.g., by about 1 to 15 amino acid residues) 
depending on the criteria used to define each domain. In additional embodiments, the 
polynucleotides of the invention encode functional attributes of all. 

Preferred embodiments of the invention in this regard include fragments that 
comprise alpha-helix and alpha-helix forming regions ("alpha-regions"), beta-sheet and 
beta-sheet forming regions ("beta-regions"), turn and turn-forming regions ("turn- 
regions"), coil and coil-forming regions ("coil-regions"), hydrophilic regions, 
hydrophobic regions, alpha amphipathic regions, beta amphipathic regions, flexible 
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regions, surface-forming regions and high antigenic index regions of the present 
invention. 

The data representing the structural or functional attributes of al 1 set forth in 
Figure 21 and/or Table VII, as described above, was generated using the various modules 
and algorithms of the DNA*STAR set on default parameters. In a preferred 
embodiment, the data presented in columns VIII, IX, XIII, and XIV of Table VII can be 
used to determine regions of al 1 which exhibit a high degree of potential for 
antigenicity. Regions of high antigenicity are determined from the data presented in 
columns VIII, IX, XIII, and/or XIV by choosing values which represent regions of the 
polypeptide which are likely to be exposed on the surface of the polypeptide in an 
environment in which antigen recognition may occur in the process of initiation of an 
immune response. 

Certain preferred regions in these regards are set out in Figure 21, but may, as 
shown in Table VII, be represented or identified by using tabular representations of the 
data presented in Figure 21. The DNA*STAR computer algorithm used to generate 
Figure 21 (set on the original default parameters) was used to present the data in Figure 
21 in a tabular format (See Table VII). The tabular format of the data in Figure 21 is 
used to easily determine specific boundaries of a preferred region. The above-mentioned 
preferred regions set out in Figure 21 and in Table VII include, but are not limited to, 
regions of the aforementioned types identified by analysis of the amino acid sequence set 
out in Figures 19A-F. As set out in Figure 21 and in Table VII, such preferred regions 
include Garni er-Robson alpha-regions, beta-regions, turn-regions, and coii-regions, 
Chou-Fasman alpha-regions, beta-regions, and turn-regions, Kyte-Doolittle hydrophilic 
regions and Hopp-Woods hydrophobic regions, Eisenberg alpha- and beta-amphipathic 
regions, Karplus-Schulz flexible regions, Jameson-Wolf regions of high antigenic index 
and Emini surface-forming regions. Even if deletion of one or more amino acids from the 
N-terminus of a protein results in modification of loss of one or more biological 
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functions of the protein, other functional activities (e.g., biological activities, ability to 
multimerize, etc.) may still be retained. For example, the ability of shortened all 
muteins to induce and/or bind to antibodies which recognize the complete or mature 
forms of the polypeptides generally will be retained when less than the majority of the 
residues of the complete or mature polypeptide are removed from the N-terminus. 
Whether a particular polypeptide lacking N-terminal residues of a complete polypeptide 
retains such immunologic activities can readily be determined by routine methods 
described herein and otherwise known in the art. It is not unlikely that an all mutein 
with a large number of deleted N-terminal amino acid residues may retain some 
biological or immunogenic activities. In fact peptides composed of as few as six al 1 
amino acid residues may often evoke an immune response. 

Accordingly, the present invention further provides polypeptides having one or 
more residues deleted from the amino terminus of the al 1 amino acid sequence shown in 
Figures 19A-F, up to the threonine residue at position number 11 84 and polynucleotides 
encoding such polypeptides. In particular, the present invention provides polypeptides 
comprising the amino acid sequence of residues nl-1 189 of Figures 19A-F, where nl is 
an integer from 2 to 11 84 corresponding to the position of the amino acid residue in 
Figures 19A-F (which is identical to the sequence shown as SEQ ID NO:35). In another 
embodiment, N-terminal deletions of the al 1 polypeptide can be described by the general 
formula n2-l 189, where n2 is a number from 2 to 1 184, corresponding to the position of 
amino acid identified in Figure 19. N-terminal deletions of the al 1 polypeptide of the 
invention shown as SEQ ID NO:35 include polypeptides comprising the amino acid 
sequence of residues: N-terminal deletions of the al 1 polypeptide of the invention shown 
as SEQ ID NO:35 include polypeptides comprising the amino acid sequence of residues: 
D-2 to E-1189; L-3 to E-l 189; P-4 to E-l 189; R-5 toE-1 189; G-6 to E-l 189; L-7 to fi- 
ll 89; V-8 to E-l 189; V-9 to E-l 189; A- 10 to E-l 189; W- 11 to E-l 189; A- 12 to E-l 189; 
L-13 to E-l 189;S-14 to E-l 189; L-15 to E-l 189; W-16 to E-l 189; P-17 to E-l 189;G-18 
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to E-1189; F-19 to E-1189; T-20 to E-1189; D-21 to E-1189;T-22 to E-1189; F-23 to E- 
1 189; N-24 to E-l 189; M-25 to E-1189;D-26 to E-l 189; T-27 to E-l 189; R-28 to E- 
1189; K-29 to E-l 189;P-30 to E-l 189; R-31 to E-l 189; V-32 to E-1189; 1-33 to E- 
1 189;P-34 to E-1189; G-35 to E-l 189; S-36 to E-l 189; R-37 to E-l 189;T-38 to E-l 189; 
A-39 to E-l 189; F-40 to E-l 189; F-41 to E-l 189;G-42 to E-l 1 89; Y-43 to E-l 1 89; T-44 
to E-l 189; V-45 to E-H 89;Q-46 to E-l 189; Q-47 to E-l 189; H-48 to E-l 189; D-49 to fi- 
ll 89;I-50 to E-l 189; S-51 to E-l 189; G-52 to E-1189; N-53 to E-1189;K-54 to E-l 189; 
W-55 to E-1189; L-56 to E-l 189; V-57 to E-1189;V-58 to E-1189; G-59 to E-l 189; A- 
60 to E-l 189; P-61 to E-l 189;L-62 to E-l 1 89; E-63 to E-1189; T-64 to E-l 189; N-65 to 
E- 1 1 89;G-66 to E-l 189; Y 67 to E-1189; Q-68 to E-l 189; K-69 to E-l 189 ;T-70 to E- 
1 1 89; G-71 to E-1 189; D-72 to E-l 189; V-73 to E-l 189;Y-74 to E-l 189; K-75 to fi- 
ll 89; C-76 to E-l 189; P-77 to E-1189;V-78 to E-l 189; 1-79 to E-1189; H-80 to E-l 189; 
G-81 to E-l 189;N-82 to E-l 189; C-83 to E-l 189; T-84 to E-l 189; K-85 to E-l 189;L-86 
to E-l 189; N-87 to E-1189; L-88 to E-1189; G-89 to E-1189;R-90 to E-1189; V-91 to fi- 
ll 89; T-92 to E-l 189; L-93 to E-1189;S-94 to E-1189; N-95 to E-1189; V-96 to E-1189; 
S-97 to E-1189;E-98 to E-1189; R-99 to E-1189; K-100 to E-1189; D-101 to E-l 189;N- 
102 to E-1189; M-103 to E-1189; R-104 to E-1189; L-105 toE-1 189; G-106 to E-l 189; 
L-107 to E-1189; S-108 to E-l 189; L-109to E-1189; A-l 10 to E-1189;T-111 to E-1189; 
N-l 12 to E-l 189;P-1 13 to E-1189; K-114 to E-l 189; D-l 15 to E-l 189; N-l 16 toE-1 189; 
S-117toE-1189;F-118to E-1189; L-l 19 to E-1189; A-120to E-1189; C-121 to E-l 189; 
S-122 to E-1189; P-123 to E-1189;L-124 to E-1189; W-125 to E-1189; S-126 to E-1189; 
H-127 toE-1189; E-l 28 to E-1189; C-129 to E-1189; G-130 to E-1189; S-131to E-1189; 
S-132 to E-l 189; Y-133 to E-l 189; Y-134 to E-1189;T-135 to E-l 189; T-136 to E-l 189; 
G-137 to E-l 189; M-138 toE-1189; C-139 to E-l 189; S- 140 to E-l 189; R-141 to E- 
1189; V-142to E-1189; N-143 to E-1189; S-144 to E-1189; N-145 to E-1189;F-146 to fi- 
ll 89; R-147 to E-l 189; F-148 to E-l 189; S-149 toE-1 189; K-150 to E-1189; T-l 51 to E- 
1189; V-152 to E-l 189; A-153to E-1189; P-154 to E-1189; A-155 to E-l 189; L-156 to 
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E-1189;Q-157 to E-1189;R-158 toE-1189; C-159 to E-1189;Q-i60 toE-1189; T-161 to 
E-1189; Y-162 to E-1189; M-163 toE-1189; D-164to E-1189; 1-165 to E-1189; V 166 to 
E-1189; 1-167 to E-1189; V-168to E-l 189; L-169 to E-1189; D-170 to E-l 189; G-171 to 
E-1189;S-172 to E-1189; N-173 to E-1189; S-174 to E-1189; 1-175 toE-1189; Y-176 to 
E-1189; P-177 to E-1189; W-178 to E-1189; V-179to E-1189; E-180 to E-1189; V-181 
to E-1189; Q-182 to E-1189;H-183 to E-1189; F-184 to E-l 189; L-l 85 to E-1189; 1-186 
toE-1 189; N-187 to E-l 189; 1-188 to E-l 189; L-189 to E-l 189; K-190to E-l 189; K-191 
to E-1189; F-192 to E-1189; Y-193 to E-1189;I-194 to E-1189; G-195 to E-1189; P-196 
to E-1189; G-197 toE-1189; Q-198 to E-1189; 1-199 to E-1189; Q-200 to E-1189; V- 
201 to E-1189; G-202 to E-1189; V-203 to E-1189; V-204 to E-1189;Q-205 to E-1189; 
Y-206 to E-l 189; G-207 to E-l 189; E-208 toE-1189; D-209 to E-1189; V-210 to E- 
1189; V-21 1 to E-1189; H-212to E-l 189; E-213 to E-l 189; F-214 to E-1189; H-215 to 
E-1189;L-216 to E-1189; N-217 to E-1189; D-218 to E-1189; Y-219 toE-1189; R-220 to 
E-1189; S-221 to E-1189; V-222 to E- 1 1 89; K-223to E-1189; D-224 to E-l 189; V-225 
to E-l 189; V-226 to E-l 189;E-227 to E-l 1 89; A-228 to E-l 189; A-229 to E-l 189; S-230 
toE-1189; H-231 to E-1189; 1-232 to E-l 189; E-233 to E-1189; Q-234to E-1189; R-235 
to E-1189; G-236 to E-1189; G-237 to E-1189;T-238 to E-1189; E-239 to E-1189; T-240 
to E-l 189; R-241 toE-1 189; T-242 to E-l 189; A-243 to E-l 189; F-244 to E-l 189; G- 
245to E-l 189; 1-246 to E-l 189; E-247 to E-l 189; F-248 to E-l 189;A-249 to E-l 189; R- 
250 to E-1189; S-251 to E-1189; E-252 toE-1189; A-253 to E-1189; F-254 to E-l 189; Q- 
255 to E-l 189; K-256to E-l 189; G-257 to E-l 189; G-258 to E-l 189; R-259 to E- 
1189;K-260 to E-l 189; G-261 to E-l 189; A-262 to E-l 189; K-263 toE-1 189; K-264 to 
E-l 189; V-265 to E-l 189; M-266 to E-l 189; I-267to E ll 89; V-268 to E-l 189; 1-269 to 
E-1189; T-270 to E-1189;D-271 to E-l 189; G 272 to E-1189; E-273 to E-l 189; S-274 
toE-1189; H-275 to E-l 189; D-276 to E-1189; S-277 to E-l 189; P-278to E-1189; D-279 
to E- 11 89; L-280 to E- 1 1 89; E-28 1 to E- 1 1 89;K-282 to E- 1 1 89 ; V-283 to E- 1 1 89; 1-284 
to E-1189; Q-285 toE-1189; Q-286 to E-1189; S-287 to E-1189; E-288 to E-1189; R- 
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289to E-l 189; D-290 to E-l 189; N-291 to E-l 189; V-292 to E-1189;T-293 to E-l 189; 
R-294 to E-1189; Y-295 toE-1189; A-296 toE-1189; V-297 to E-1189; A-298 to E- 
1 189; V-299 to E-l 189; L-300to E-1189; G-301 to E-l 189; Y-302 to E-l 189; Y-303 to 
E-1189;N-304 to E-1189; R-305 to E-1189; R-306 to E-1189; G-307 toE-1189; 1-308 to 
E-1189; N-309 to E-l 189; P-310 to E-1189; E-31 1 to E-l 189; T-312 to E-1189; F-313 to 
E-1189; L-314to E-1189;N-315 to E-l 189; E-316 to E-1189;I-317 to E-1189; K-318 
toE-1189; Y-319toE-1189; 1-320 to E-1189; A-321 to E-1189; S-322to E-1189; D-323 
to E-l 189; P-324 to E-l 189; D-325 to E-l 1 89;D-326 to E-l 189; K-327 to E-l 189; H- 
328 to E-l 189; F-329 toE-1189; F-330 to E-l 189: N-331 to E-1189; V-332 to E-l 189; T- 
333to E-l 189; D-334 to E-l 189; E-335 to E-1189; A-336 to E-l 189;A-337 to E-l 189; L- 
338 to E-1189; K-339 to E-l 189; D-340 toE-1 189; 1-341 to E-1189; V-342 to E-l 189; 
D-343 to E-l 189; A-344to E-l 189; L-345 to E-1189; G-346 to E-l 189; D-347 to E- 
1189;R-348 to E-1189; 1-349 to E-l 189; F-350 to E-l 189; S-351 toE-1189; L-352 to E- 
1189; E-353 to E-1189; G-354 to E-1189; T-355to E-1189; N-356 to E-l 189; K-357 to 
E-1189; N-358 to E-1189;E-359 to E-l 189; T-360 to E-1189; S-361 to E-1189; F-362 
toE-1189; G-363 to E-1189; L-364 to E-1189; E-365 to E-1189; M-366to E-1189; S-367 
to E-1189; Q-368 to E-1189; T-369 to E-l 189;G-370 to E-l 189; F-371 to E-1189; S-372 
to E-1189; S-373 toE-1189; H-374 to E-1189; V-375 to E-1189; V-376 to E-1189; E- 
377to E-l 189; D-378 to E-l 189; G-379 to E-l 1 89; V-380 to E-l 189;L-381 to E-l 189; L- 
382 to E-l 189; G-383 to E-1189; A-384 toE-1 189; V-385 to E-1189; G-386 to E-l 189; 
A-387 to E-1189; Y-388to E-l 189; D-389 to E-l 189; W-390 to E-l 189; N-391 to E- 
1189;G-392 to E-1189; A-393 to E-1189; V-394 to E-1189;L-395 toE-1189; K-396 to 
E-1189; E-397 to E-1189; T-398 to E-1189; S-399to E-1189; A-400 to E-1189; G-401 to 
E-1189; K-402 to E-1189;V-403 to E-1189; 1-404 to E-1189; P-405 to E-1189; L-406 
toE-1189; R-407 to E-1189; E-408 to E-1189; S-409 to E-1189; Y-410to E-1189; L-411 
to E-l 189; K-412 to E-l 189; E-413 to E-1189;F-414 to E-1189; P-415 to E-l 189; E-416 
to E-l 189; E-417 toE-1 189; L-418 to E-1189; K-419 to E-1189; N-420 to E-l 189; H- 


SUBSTITUTE SHEET (RULE 26) 


WO 00/29435 PCTYUS99/25031 

115 

421to E-l 189; G-422 to E-l 189; A-423 to E-l 189; Y-424 to E-l 189;L-425 to E-l 1 89; 
G-426 to E-l 189; Y-427 to E-l 189; T-428 toE-1189; V-429 to E-l 189; T-430 to E-1189; 
S-431 to E-l 189; V-432to E-l 189; V-433 to E-l 189; S-434 to E-l 189; S-435 to E- 
1189;R-436 to E-l 189; Q-437 to E-l 189; G-438 to E-l 189; R-439 toE-1 189; V-440 to 
E-l 189; Y-441 to E-l 189; V-442 to E-l 189; A-443to E-l 189; G-444 to E-l 189; A-445 
to E- 1 1 89; P-446 to E- 1 1 89;R-447 to E- 1 1 89; F-448 to E- 1 1 89; N-449 to E- 1 1 89; H-450 
toE-1 189; T-451 to E-l 189; G-452 to E-l 189; K-453 to E-l 189; V-454to E-l 189; 1-455 
to E-l 189; L-456 to E-l 189; F-457 to E-l 189; T-458to E-l 189; M-459 to E-l 189; H-460 
to E-l 189; N-461 to E-1189;N-462 to E-l 189; R-463 to E-l 189; S-464 to E-1189; L-465 
toE-1 189; T-466 to E ll 89; 1-467 to E-l 189; H-468 to E-l 189; Q-469to E-l 189; A-470 
to E-l 189; M-471 to E-l 189; R-472 to E-1189;G-473 to E-l 189; Q-474 to E-l 189; Q- 
475 to E-l 189; 1-476 toE-1 189; G-477 to E-l 189; S-478 to E-l 189; Y-479 to E-l 189; F- 
480to E-l 189; G-481 to E-l 189; S-482 to E-l 189; E-483 to E-l 189;I-484 to E-l 189; T- 
485 to E-l 189; S-486 to E-l 189; V-487 toE-1189; D-488 to E-l 189; 1-489 to E-l 189; D- 
490 to E-l 189; G-491to E-l 189; D-492 to E-l 189; G-493 to E-l 189; V-494 to E- 
1189;T-495 to E-1189; D-496 to E-1189; V-497 to E-l 189; L-498 toE-1189; L-499 to E- 
1 189; V-500 to E-1189; G-501 to E-1189; A-502to E-l 189; P-503 to E-1189; M-504 to 
E-1189; Y-505 to E-1189;F-506 to E-1189; N-507 to E-1189; E-508 to E-1189; G-509 
toE-1189; R-510 to E-l 189; E-511 to E-1189; R-512 to E-l 189; G-513to E-1189; K-514 
to E-1189; V-515 to E-1189; Y-516 to E-1189;V-517 to E-1189; Y-518 to E-1189; E- 
519 to E-1189; L-520 toE-1189; R-521 to E-1189; Q-522 to E-1189; N-523 to E-1189; 
R-524to E-1189; F-525 to E-1189; V-526 to E-1189; Y-527 to E-1189;N-528 to E-1189; 
G-529 to E-1189; T-530 to E-1189; L-531 toE-1189; K-532 to E-1189; D-533 to E-1189; 
S-534 to E-l 189; H-535to E-1189; S-536 to E-l 189; Y-537 to E-l 189; Q-538 to E- 
1189;N-539 to E-1189; A-540 to E-1189; R-541 to E-l 189; F-542 toE-1189; G-543 to E- 
1 189; S-544 to E-l 189; S-545 to E-l 189; I-546to E-l 189; A-547 to E-l 189; S-548 to E- 
1 189; V-549 to E-l 189;R-550 to E-l 189; D-551 to E-l 189; L 552 to E-l 189; N-553 
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toE-1189; Q-554toE-1189; D-555 to E-1189; S-556 to E-1189; Y-557to E-U89; N-558 
to E-1189; D-559 to E-1189; V-560 to E-1189;V-561 to E-1189; V-562 to E-1189; G- 
563 to E-1189; A-564 toE-1189; P-565 to E-1189; L-566 to E-1189; E-567 to E-1189; 
D-568to E-1189; N-569 to E-1189; H-570 to E-1189; A-571 to E-1189;G-572 to E-1189; 
A-573 to E-l 189; 1-574 to E-l 189; Y-575 toE-1 189; 1-576 to E-l 189; F-577 to E-l 189; 
H-578 to E-l 189; G-579to E-1189; F-580 to E-1189; R-581 to E-l 1 89; G-582 to E- 
1 189;S-583 to E-1189; 1-584 to E-1189; L-585 to E-l 189; K-586 toE-1 189; T-587 to E- 
1189; P-588 to E-l 189; K-589 to E-l 189; Q-590to E-1189; R-591 to E-l 189; 1-592 to E- 
1189; T-593 to E-1189;A-594 to E-1189; S-595 to E-1189; E-596 to E-1189; L-597 toE- 
1 189; A-598 to E-l 189; T-599 to E-1189; G-600 to E-l 189; L-601to E-l 189; Q-602 to 
E-1189; Y-603 to E-1189; F-604 to E-1189;G-605 to E-1189; C-606 to E-1189; S-607 to 
E-l 189; 1-608 toE-1 189; H-609 to E-l 189; G-610 to E~l 1 89; Q-61 1 to E-l 189; L-612to 
E-1189; D-613 to E-l 189; L-614 to E-1189; N-615 to E-l 189;E-616 to E-1189; D-617 to 
E-1189; G-618 to E-l 189; L-619 toE-1189; 1-620 to E-l 189; D-621 to E-1189; L-622 to 
E-1189; A-623to E-1189; V-624 to E-1189; G-625 to E-1189; A-626 to E-1189;L-627 to 
E-1189; G-628 to E-1189; N-629 to E-1189; A-630 toE-1189; V-631 to E-1189; 1-632 to 
E-U89; L-633 to E-l 189; W-634to E-l 189; S-635 to E-l 189; R-636 to E-l 1 89; P-637 to 
E-1189;V-638 to E-l 189; V-639 to E-1189; Q-640 to E-l 189; 1-641 toE-1 189; N-642 to 
E-l 189; A-643 to E-l 189; S-644 to E-l 189; L-645to E-l 189; H-646 to E-l 189; F-647 to 
E-1189; E-648 to E-1189;P-649 to E-1189; S-650 to E-1189; K-651 to E-l 189; 1-652 
toE-1189; N-653 to E-l 189; 1-654 to E-l 189; F-655 to E-l 189; H-656to E-l 189; R-657 
to E-1189; D-658 to E-l 189; C-659 to E-1189;K-660 to E-1189; R-661 to E-1189; S-662 
to E-1189; G-663 toE-1189; R-664 to E-l 189; D-665 to E-1189; A-666 to E-1189; T- 
667to E-l 189; C-668 to E-l 1 89; L-669 to E-l 189; A-670 to E-l 189;A-671 to E-l 189; F- 
672 to E-l 189; L-673 to E-l 189; C-674 toE-1189; F-675 to E-1189; T-676 to E-l 189; P- 
677 to E-l 189; 1-678 toE-1 189; F-679 to E-l 189; L-680 to E-l 189; A-681 to E-l 189; P- 
682to E-l 189; H-683 to E-l 189; F-684 to E-l 189; Q-685 to E-l 189;T-686 to E-l 189; T- 
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687 to E-l 189; T-688 to E-l 189; V-689 toE-1189; G-690 to E-l 189; 1-691 to E-l 189; R- 
692 to E-l 189; Y-693to E-l 189; N-694 to E-1189; A-695 to E-l 189; T-696 to E- 
1 189;M-697 to E-l 189; D-698 to E-l 189; E-699 to E-l 189; R-700 toE-1189; R-701 to 
E-1189; Y-702 to E-1189; T-703 to E-1189; P-704to E-1189; R-705 to E-1189; A-706 to 
E-1189; H-707 to E-1189;L-708 to E-1189; D-709 to E-l 189; E-710 to E-1189; G-711 
toE-1 189; G-712 to E-l 189; D-713 to E-l 189; R-714 to E-l 189; F-715to E-l 189; T-716 
to E-1189; N-717 to E-l 189; R-718 to E-l 189;A-719 to E-1189; V-720 to E-1189; L- 
721 to E-l 189; L-722 toE-1 189; S-723 to E-l 189; S-724 to E-l 189; G-725 to E-l 189; Q- 
726to E-1189; E-727 to E-1189; L-728 to E-1189; C-729 to E-1189;E-730 to E-1189; R 
731 to E-1189; 1-732 to E-l 189; N-733 toE-1189; F-734 to E-l 189; H-735 to E-1189; V- 
736 to E-1189; L-737to E-1189; D-738 to E-l 189; T-739 to E-1189; A-740 to E-l 189;D- 
741 to E-l 189; Y-742 to E-l 189; V-743 to E-l 189; K-744 toE-1 189; P-745 to E-l 189; 
V-746 to E-l 189; T-747 to E-l 1 89; F-748to E-l 189; S-749 to E-l 189; V-750 to E-l 189; 
E-751 to E-1189;Y-752 to E-1189; S-753 to E-1189; L-754 to E-l 189; E-755 toE-1189; 
D-756 to E-1189; P-757 to E-1189; D-758 to E-1189; H-759to E-1189; G-760 to fi- 
ll 89; P-761 to E-1189; M-762 to E-1189;L-763 to E-1189; D-764 to E-1189; D-765 to 
E-l 189; G-766 toE-1189; W-767 to E-1189; P-768 to E-1189; T-769 to E-1189; T-770to 
E-1189; L-771 to E-1189; R-772 to E-1189; V-773 to E-1189;S-774 to E-1189; V-775 to 
E-l 1 89; P-776 to E-l 189; F-777 toE-1 189; W-778 to E-l 1 89; N-779 to E-l 189; G-780 
to E-l 189; C-781to E-l 189; N-782 to E-l 189; E-783 to E-l 189; D-784 to E-l 189;E-785 
to E-1189; H-786 to E-1189; C-787 to E-1189; V-788 toE-1189; P-789 to E-1189; D-790 
to E-1189; L-79 1 to E- 1 1 89; V-792to E-1189; L-793 to E- 1 1 89; D-794 to E- 1 1 89; A-795 
to E- 11 89;R-796 to E- 1 1 89; S-797 to E-l 1 89; D-798 to E- 1 1 89; L-799 toE-1 1 89; P-800 
to E-l 189; T-801 to E-1189; A-802 to E-1189; M-803to E-1189; E-804 to E-l 189; Y- 
805 to E-1189; C-806 to E-1189;Q-807 to E-1189; R-808 to E-l 189; V-809 to E-1189; 
L-810toE-1189; R-811 to E-1189; K-812to E-1189; P-813 to E-1189; A-814to E-1189; 
Q-815 to E-1189;D-8i6to E-1189; C-817to E-1189;S-818 to E-l 189; A-81 9 to E-l 189; 
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Y-820 to E-1189; T-821 toE-1189; L-822 to E-1189; S-823 to E-1189; F-824 toE-1189; 
D-825to E-1189; T-826 to E-l 189; T-827 to E-l 189; V-828 to E-l 189;F-829 to E-1189; 
1-830 to E-l 189; 1-831 to E-1189; E-832 to E-1189;S-833 to E-l 189; T-834 to E-1189; 
R-835 to E-l 189; Q-836 toE-1 189; R-837 to E-l 189; V-838 to E-l 189; A-839 to E- 
1189; V-840toE-1189; E-841 to E-l 189; A-842 to E-1189; T-843 to E-l 189;L-844 to E- 
1 189; E-845 to E-l 189; N-846 to E-l 189; R-847 toE-1 189; G-848 to E-l 189; E-849 to 
E-l 189; N-850 to E-1189; A-851to E-l 189; Y-852 to E-l 189; S-853 to E-l 189; T-854 to 
E-1189;V-855 to E-1189; L-856 to E-1189; N-857 to E-1189; 1-858 toE-1189; S-859 to 
E-1189; Q-860 to E-1189; S-861 to E-1189; A-862to E-1189; N-863 to E-1189; L-864 to 
E-1189; Q-865 to E-1189;F-866 to E-1189; A-867 to E-1189; S-868 to E-l 189; L-869 
toE-1189; 1-870 to E-1189; Q-871 to E-1189; K-872 to E-1189; E-873to E-1189; D-874 
to E-1189; S-875 to E-l 189; D-876 to E-l 189;G-877 to E-l 189; S-878 to E-l 189; 1-879 
to E-1189; E-880toE-1189; C-88I to E-l 189; V-882 to E-1189; N-883 to E-1189; E- 
884to E-l 189; E-885 to E-1189; R-886 to E-l 189; R-887 to E-l 189;L-888 to E-l 189; Q- 
889 to E-1189; K-890 to E-1189; Q-891 toE-1 189; V-892 to E-1189; C-893 to E-1189; 
N-894 to E-1189; V-895to E-1189; S-896 to E-1189; Y-897 to E-l 189; P-898 to E- 
1189;F-899 to E-1189; F-900 to E-1189; R-901 to E-l 189; A-902 toE-1189; K-903 to E- 
1189; A-904 to E-l 189; K-905 to E-1189; V-906to E-1189; A-907 to E-1189; F-908 to 
E-l 189; R-909 to E-l 189;L-910 to E-l 189; D-91 1 to E-l 189; F-912 to E-l 189; E-913 
toE-1189; F-914 to E-l 189; S-915 to E-1189; K-916 to E-1 189; S-917to E-l 189; 1-918 
to E-l 189; F-919 to E-l 189; L-920 to E-l 189;H-921 to E-l 189; H-922 to E-l 189; L-923 
to E-1189; E-924 toE-1 189; 1-925 to E-l 189; E-926 to E-l 189; L-927 to E-l 189; A- 
928to E-1189; A-929 to E- 1 1 89; G-930 to E-1189; S-931 to E-1189;D-932 to E-l 189; S- 
933 to E-1189; N-934 to E-1189; E-935 toE-1189; R-936 to E-1189; D-937 to E-1189; 
S-938 to E-l 189; T-939to E-l 189; K-940 to E-l 189; E-941 to E-l 189; D-942 to fi- 
ll 89;N-943 to E-l 189; V-944 to E-l 189; A-945 to E-l 189; P-946 toE-1 189; L-947 to E- 
1189; R-948 to E-l 189; F-949 to E-l 189; H-950to E-l 189; L-951 to E-l 189; K-952 to 
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E-l 189; Y-953 to E-l 189;E-954 to E-l 189; A-955 to E-1189; D-956 to E-l 189; V-957 
toE-1 189; L-958 to E-1189; F-959 to E-1189; T-960 to E-1189; R-961to E-1189; S-962 
to E-l 189; S-963 to E-l 189; S-964 to E-l 189;L-965 to E-l 189; S-966 to E-l 189; H-967 
to E-l 189; Y-968 toE-1 189; E-969 to E-l 189; V-970 to E-l 189; K-971 to E-l 189; L- 
972to E-1189; N-973 to E-l 189; S-974 to E-1189; S-975 to E-lI89;L-976 to E-1189; E- 
977 to E-1189; R-978 to E-l 189; Y-979 toE-1 189; D-980 to E-l 189; G-981 to E-l 189; J- 
982 to E-1189; G-983to E-1189; P-984 to E-1189; P-985 to E-l 189; F-986 to E-l 189;S- 
987 to E-l 189; C-988 to E-l 189; 1-989 to E-1189; F-990 toE-1 189; R-991 to E-l 189; I- 
992 to E-1189; Q-993 to E-1189; N~994to E-1189; L-995 to E-1189; G-996 to E-1189; 
L-997 to E-l 189;F-998 to E-1189; P-999 to E-1 189; 1-1000 to E-l 189; H-1001 toE- 
1 189; G-1002 to E-l 189; 1-1003 to E-1189; M-1004 to E-l 189;M-1005 to E-1189; K- 
1006 to E-l 189; 1-1007 to E-l 189; T-1008 toE-1 189; 1-1009 to E-l 189; P-1010 to E- 
1 189; 1-101 1 to E-l 189;A-1012 to E-l 189; T-1013 to E-l 189; R-1014 to E-l 189; S-1015 
toE-1189; G-1016 to E-l 189; N-1017 to E-l 189; R-1018 to E-l 189;L-1019 to E-l 189; 
L-1020 to E-1189; K-1021 to E-1189; L-1022 toE-1189; R-1023 to E-1189; D-1024 to 
E-1189; F-1025 to E-l 189;L- 1026 to E-1189; T-1027 to E-1189; D-1028 to E-1189; E- 
1029 toE-1 1 89; V-1030 to E-l 189; A-1031 to E-1189; N-1032 to E-l 189;T-1033 to E- 
1 189; S-1034 to E-1189; C-1035 to E-1189; N-1036 toE-1 189; 1-1037 to E-1189; W- 
1038 to E-1189; G-1039 to E-l 189;N-1040 to E-l 189; S-1041 to El 189; T-1042 to E- 
1189;E-1043 toE-1189; Y-1044 to E-1189; R-1045 to E-l 189; P-1046 to E-1189;T- 
1047 to E-l 189; P-1048 to E-1189; V-1049 to E-1189; E-1050 toE-1189; E-1051 to E- 
1189; D-1052 to E-1189; L-1053 to E-l 189;R-1054 to E-1189; R-1055 to E-1189; A- 
1056 to E-l 189; P-1057 toE-1 189; Q-1058 to E-l 189; L-1059 to E-l 189; N-1060 to E- 
1189;H-1061 to E-1189; S-1062 to E-l 189; N-1063 to E-l 189; S-1064 toE-1 189; D- 
1065 to E-l 189; V-1066 to E-1189; V-1067 to E-l 189;S- 1068 to E-l 189; 1-1069 to E- 
1 189; N-1070 to E-l 189; C-1071 toE-1 189; N-1072 to E-l 189; M073 to E-1189; R- 
1074 to E-l 189;L-1075 to E-l 189; V-1076 to E-l 189; P-1077 to E-l 189; N-1078 toE- 
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1189; Q-1079 to E-1189; E-1080 to E-1189; 1-1081 to E-1189;N-1082 to E-1189; F- 
1083 to E-1189; H-1084 to E-1189; L-1085 toE-1189; L-1086 to E-1189; G-1087 to E- 
1189; N-1088 to E-1189;L-1089 to E-l 189; W-1090 to E-1189; L-1091 to E-1189; R- 
1092 toE-1189; S-1093 to E-1189; L-1094 to E-1189; K-1095 to E-l 189;A-1096 to E- 
1189; L-1097 to E-1189; K-1098 to E-1189; Y-1099 toE-1189; K-l 100 to E-1189; S- 
1101 to E-1189; M-1102 to E-1189;K-1103 to E-l 189; 1-1 104 to E-1189; M-1105 to E- 
1189; V-l 106 toE 1189; N-l 107 to E-1189; A-1108 to E-1189; A-1109 to E-1189;L- 
1110 to E-1189; Q-llll to E-1189; R-l 1 12 to E-l 189; Q-l 113 toE-1189; F-1114to E- 
1189; H-1115to E-1189; S-1116to E-l 189:P-1 1 17 to E-1189; F-1118 to E-1189; 1-1119 
to E-1189; F-1120 toE-1189; R-1121 to E-l 189; E-l 122 to E-l 189; E~1123 to E-1189;D- 
1124 to E-l 189; P-l 125 to E-l 189; S-l 126 to E-1189; R-l 127 toE-1 189; Q-l 128 to E- 
1 189; 1-1 129 to E-l 189; V-l 130 to E-l 189;F-1 131 to E-l 189; E-l 132 to E-l 189; 1-1133 
to E-l 189; S-l 134 toE-1 189; K-l 135 to E-l 189; Q-l 136 to E-l 189; E-l 137 to E~ 
1189;D-1138to E-l 189; W-l 139 to E-l 189; Q-l 140 to E-1189; V-1141 toE-1189; P- 
1142 to E-1189; 1-1143 to E-1189; W-l 144 to E-1189;I-1 145 to E-l 189; I- 1 146 to E- 
1189; V-1147 to E-1189; G-1148 toE-1189; S-l 149 to E-1189; T-1150 to E-1189; L- 
1151 to E-l 189;G-1152 to E-1189; G-1153 to E-1189; L-1154 to E-1189; L-1155 toE- 
1189; L-l 156 to E-1189; L-1157 to E-1189; A-1158 to E-1189;L-11 59 to E-l 1 89; L- 
1160 to E-1189; V-l 161 to E-1189; L-1162 toE-1189; A-1163 to E-l 189; L-l 164 to E- 
1189; W-l 165 to E-l 189;K-1 166 to E-1189; L-l 167 to E-l 189; G-l 168 to E-l 189; F- 
1169 toE-1189; F-1170 to E-1189; R-1171 to E-1189; S-l 172 to E-1189;A-1173 to E- 
1189; R-l 174 to E-l 189; R-1175 to E-1189; R-l 176 toE-1189; R-l 177 to E-l 189; E- 
1178 to E-1189; P-l 179 to E-1189;G-1180 to E-1189; L-1181 to E-l 189; D-l 182 toE- 
1 189; P-l 183 toE-1 189; T-l 184 to E-l 189; of SEQ ID NO:35. Polypeptides encoded by 
these polynucleotides are also encompassed by the invention. 

Also as mentioned above, even if deletion of one or more amino acids from the 
C-terminus of a protein results in modification or loss of one or more biological 
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functions of the protein, other functional activities (e.g., biological activities (e.g., ability 
to illicit mitogenic activity, induce differentiation of normal or malignant cells, ability to 
multimerize, etc.) may still be retained. For example the ability of the shortened al 1 
mutein to induce and/or bind to antibodies which recognize the complete or mature forms 
of the polypeptide generally will be retained when less than the majority of the residues 
of the complete or mature polypeptide are removed from the C-terminus. Whether a 
particular polypeptide lacking C-terminal residues of a complete polypeptide retains such 
immunologic activities can readily be determined by routine methods described herein 
and otherwise known in the art. It is not unlikely that an al 1 mutein with a large number 
of deleted C-terminal amino acid residues may retain some biological or immunogenic 
activities. In fact, peptides composed of as few as six al 1 amino acid residues may often 
evoke an immune response. 

Accordingly, the present invention further provides polypeptides having one or 
more residues deleted from the carboxy terminus of the amino acid sequence of the al 1 
polypeptide shown in Figures 19A-F, up to the glycine residue at position number 6, and 
polynucleotides encoding such polypeptides. In particular, the present invention 
provides polypeptides comprising the amino acid sequence of residues 1-ml of Figures 
19A-F, where ml is an integer from 6 to 1189 corresponding to the position of the amino 
acid residue in Figures 19A-R Moreover, the invention provides polynucleotides 
encoding polypeptides comprising, or alternatively consisting of, the amino acid 
sequence of C-terminal deletions of the al 1 polypeptide of the invention shown as SEQ 
ID NO:35 include polypeptides comprising the amino acid sequence of residues: M-l to 
L-l 188; M-l to V-l 187; M-l to K-l 186;M-1 to P-1185; M-l to T-1184; M-l to P-1183; 
M-l toD-1182; M-l to L-11S1;M-1 to G-1180;M-1 toP-1179;M-l toE-1178;M-l to 
R-1177; M-l to R-1I76; M-l to R-1175; M-l to R-1174; M-l to A-1173; M-lto S-1172; 
M-l to R-1171;M-1 to F-1170; M-l toF-1169; M-l to G-1168; M-l to L-l 167; M-l to 
K-1166;M-1 to W-1165; M-l to L-l 164; M-l to A-1163; M-l to L-l 162; M-l to V- 
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1 161; M-l to L-l 160; M-l toL-1 159; M-l to A-l 158; M-l to L-l 157; M-l to L-1156; 
M-l to L-l 155; M-l to L-l 154; M-l to G-1153; M-lto G-1152; M-l to L-l 151; M-l to 
T-1150; M-l toS-1149; M-l to G-1148; M-l to V-1147;M-1 to I-1146;M-1 to 1-1145; 
M-l to W-1144; M-l toI-1143;M-l to P-1142;M-1 to V-1141;M-1 to Q-1140; M-l 
toW-1139; M-l to D-l 138; M-l to E-1137; M-l to Q-l 136; M-l to K-l 135; M-l to S- 
1134; M-l to 1-1133; M-lto E-l 132; M-l toF-1131;M-l to V-1130; M-l to 1-1129; M- 
1 to Q-l 128; M-l to R-1127; M-l toS-1126;M-l to P-U25;M-1 to D-l 124; M-l to E- 
1123; M-l to E-l 122; M-l to R-1121; M-i to F-1120; M-l toI-1119; M-l to F-1118;M- 
1 toP-1117; M-l to S-1116;M-1 to H-1115;M-1 to F-1114; M-l to Q-1113; M-lto R- 
1112; M-l to Q-l 111; M-l to L-l 110; M-l to A-1109; M-l to A-1108; M-l to N-1107; 
M-l to V-1106;M-1 to M-1105; M-l to 1-1104; M-l to K-l 103; M-l to M-l 102; M-l to 
S-1101;M-1 to K-l 100; M-l toY- 1099; M-l to K-1098; M-i to L-1097;M-1 to A-1096; 
M-l to K-1095; M-l to L-1094; M-l to S-1093; M-lto R-1092; M-l to L-1091; M-l to 
W-1090; M-l to L-1089; M-l to N-1088; M-l to G-1087; M-l to L-1086;M-1 to L- 
1085; M-l to H-1084; M-l to F-1083; M-l to N- 1082; M-l to 1-1081; M-l toE-1080; 
M-i toQ-1079; M-l to N-1078; M-l to P-1077; M-l to V-1076; M-l to L-1075; M-l to 
R-1074;M-1 to 1-1073; M-lto N-1072; M-l to C-1071;M-1 toN-1070;M-l to 1-1069; 
M-l to S-1068; M-l to V-1067; M-l to V-1066;M-1 to D-1065; M-l to S-1064; M-l to 
N-1063;M-1 to S-1062; M-l to H-1061;M-1 to N-1060; M-l toL-1059; M-l to Q-1058; 
M-l to P-1057; M-l to A-1056; M-l to R-1055; M-l to R-1054; M-l to L-1053; M-lto 
D-1052; M-l to E-1051; M-l to E-1050; M-l to V-1049; M-l to P-1048; M-l to T-1047; 
M-i to P-1046;M-1 to R-1045; M-l to Y-1044; M-l to E-1043; M-l to T-1042; M-l to 
S-1041; M-l to N-1040; M l toG-1039; M-l to W-1038; M-l to 1-1037; M-l to N-1036; 
M-l to C-1035; M-l to S-1034; M-l to T-1033; M-lto N-1032; M-l to A-1031; M-l to 
V-1030; M-l to E-1029; M-l to D-1028; M-l to T-1027; M-l to L-1026;M-1 to F-1025; 
M-l to D-1024; M-l to R-1023; M-l to L-1022; M-l to K-I021 ; M-l to L-1020; M-l 
toL-1019; M-l to R-1018; M-l to N-1017; M-l to G-1016; M-l to S-1015; M-l to R- 
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1014; M-l toT-1013;M-lto A-1012; Ml to M011;M-1 to P-1010; M-l to 1-1009; M- 
1 to T-1008; M-l to 1-1007; Ml to K-1006; M-l to M-1005; M-l to M-1004; M~l to I- 
1003; M-l to G-1002; M-l to H-1001; M-l to 1-1000; M-l to P-999;M-1 to F-998; M-l 
to L-997; M-l to G-996; M-l to L-995; M-l to N-994; M-l to Q-993; M-l to 1-992; M-l 
toR-991; M-l to F-990; M-l to 1-989; M-l to C-988; M-l to S-987; M-l to F-986; M-l 
to P-985; M-l to P-984;M-1 to G-983; M-l to 1-982; M-l to G-981; M-l to D-980; M-l 
to Y-979; M-l to R-978; M-l to E-977; M-l toL-976; M-l to S-975; M-l to S-974; M-l 
to N-973; M-l to L-972; M-l to K-971; M-l to V-970; M-l toE-969; M-l to Y-968; M-l 
to H-967; M-l to S-966; M l to L-965; M-l to S-964; M-l to S-963; M-l to S-962;M-1 
to R-961; M-l to T-960; M-l to F-959; M-l to L-958: M-l to V-957; M-l to D-956; M-l 
to A-955; M-l toE-954; M-l to Y-953; M-l to K-952; M-l to L-951; M-l to H-950; M-l 
to F-949; M-l to R-948; M-l toL-947; M-l to P-946; M-l to A-945; M-l to V-944; M-l 
to N-943; M-l to D-942; M-l to E-941;M-1 toK-940; M-l toT-939; M-l to S-938; M-l 
to D-937; M-l to R-936; M-l to E-935; M-l to N-934; M-l toS-933; M-l to D-932; M-l 
to S-931; M-l to G-930; M-l to A-929; M-l to A-928; M-l to L-927; M-l toE-926; M-l 
to 1-925; M-l to E-924; M-l to L-923; M-l to H-922; M-l to H-921; M-l to L-920; M-l 
to F-919;M-1 to 1-918; M-l to S-917; M-i to K-916; M-l to S-915; M-l to F-914; M-l 
to E-913; M-l to F-912; M-l toD-911; M-l to L-910; M-l to R-909; M-l to F-908; M-l 
to A-907; M-l to V-906; M-l to K-905; Mi toA-904; M-l to K-903; M-l to A-902; M- 
1 to R-901; M-l to F-900; M l to F-899; M-l to P-898; M-l toY-897; M-l to S-896; M- 
1 to V-895; M-l to N-894; M-l to C-893; M-l to V-892; M-l to Q-891; M-l toK-890; 
M-l to Q-889; M-l to L-888; M-l to R-887; M-l to R-886; M-l to E-885; M-l to E-884; 
M-l toN-883; M-l to V-882; M-l to C-881; M-l to E-880; M-l to 1-879; M-l to S-878; 
M-l to G-877; M-l to D-876;M-1 to S-875; M-l to D-874; M-l to E-873; M-l to K-872; 
M l to Q-871; M-i to 1-870; M-l to L-869; M-l toS-868; M-l to A-867; M-l to F-866; 
M-l to Q-865; M-l to L-864; M-l to N-863; M-l to A-862; M-l toS-861; M-l to Q-860; 
M-l to S-859; M-l to 1-858; M-l to N-857; M-l to L-856; M-l to V-855; M-l to T- 
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854;M-1 to S-853; M-l to Y-852; M-l to A-851; M-l to N-850; M-l to E-849; M-l to 
G-848; M-l to R-847; M-lto N-846; M-i to E-845; M-l to L-844; M-l to T-843; M-l to 
A-842; M-l to E-841; M-l to V-840; M-l toA-839; M-l to V-838; M-l to R-837; M-l to 
Q-836; M-l to R-835; M-l to T-834; M-l to S-833; M-l toE-832; M-l to 1-831; M-l to 
1-830; M-l to F-829; M-l to V-828; M-l to T-827; M-l to T-826; M-l to D-825;M-1 to 
F-824; M-l to S-823; M-l to L-822; M-l toT-821; M-l to Y-820; M-l to A-819; M-l to 
S-818; M-l toC-817; M-l toD-816; M-l toQ-815; M-l to A-814; M-l toP-813;M-l to 
K-812; M-l to R-81 1; M-l toL-810; M-l to V-809; M-l to R-808; M-l to Q-807; M-l to 
C-806; M-l to Y-805; M-l to E-804; M-l toM-803; M-l to A-802; M-l to T-801; M-l 
to P-800; M-l to L-799; M-l to D-798; M-l to S-797; M-l toR-796; M-l to A-795; M-l 
to D-794; M-l to L-793; M-l to V-792; M-l to L-791; M-l to D-790; M-l toP-789; M-l 
to V-788; M-l to C-787; M-l to H-786; M-l to E-785; M-l to D-784; M-l to E-783; M- 
1 toN-782; M-l to C-781; M-l to G-780; M-l to N-779; M-l to W-778; M-l to F-777; 
M-l to P-776; M-l toV-775; M-l to S-774; M-l to V-773; M-l to R-772; M-l to L-771; 
M-l to T-770; M-l to T-769; M-l to P-768;M-1 to W-767; M-l to G-766; M-l to D-765; 
M-l to D-764; M-l to L-763; M-l to M-762; M-l to P-761; M-lto G-760; M-l to H- 
759; M-l to D-758; M-l to P-757; M-l to D-756; M-l to E-755; M-l to L-754; M-l toS- 
753; M-l to Y-752; M-l to E-751; M-l to V-750; M-l to S-749; M-l to F-748; M-l to 
T-747; M-l to V-746;M-1 to P-745; M-l to K-744; M-l to V-743; M-l to Y-742; M-l to 
D-741; M-l to A-740; M-l to T-739; M-lto D-738; M-l to L-737; M-l to V-736; M-l to 
H-735; M-l to F-734; M-l to N-733; M-l to 1-732; M-l toR-731; M-l to E-730; M-l to 
C-729; M-l to L-728; M-i to E-727; M-l to Q-726; M-l to G-725; M-l toS-724; M-l to 
S-723; M-l to L-722; M-l to L-721; M-l to V-720; M-l to A-719; M-l to R-718; M-l 
toN-717; M-l to T-716; M-l to F-715; M-l to R-714; M-l to D-713; M-l to G-712; M-l 
to G-71 1; M-l toE-710; M-l to D-709; M-l to L-708; M l to H-707; M-l to A-706; M-l 
to R-705; M-l to P-704; M-l toT-703; M-l to Y-702; M-l to R-701; M-l to R-700; M-l 
to E-699; M-l to D-698; M-l to M-697; M-l toT-696; M-l to A-695; M-l to N-694; M- 
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1 to Y-693; M-l to R-692; M-l to 1-691; M-l to G-690; M-l toV-689; M-l to T-688; M- 
1 to T-687; M-l to T-686; M-l to Q-685; M-l to F-684; M-l to H-683; M~l to P-682;M- 
1 to A-681;M-1 toL-680;M-l toF-679;M-l to 1-678; M-l toP-677; M-l to T-676; M- 
1 to F-675; M-l toC-674; M-l to L-673; M-l to F-672; M-l to A-671; M-l to A-670; M- 
1 to L-669; M l to C-668; M-l to T-667;M-1 to A-666; M-l to D-665; M-l to R-664; 
M-l to G-663; M-l to S-662; M-l to R-661; M-l to K-660; M-lto C-659; M-l to D-658; 
M-l to R-657; M-l to H-656; M-l to F-655; M-l to 1-654; M-l to N-653; M-l toI-652; 
M-l to K-651; M-l to S-650; M-l to P-649; M-l to E-648; M-l to F-647; M-l to H-646; 
M-l to L-645;M-1 to S-644; M-i to A-643; M-l to N-642; M-l to 1-641; M-l to Q-640; 
M-l to V-639; M-l to V-638; M-l toP-637; M-l to R-636; M~l to S-635; M-l to W-634; 
M-l to L-633; M-l to 1-632; M-l to V-631; M-l to A-630;M-1 to N-629; M-l to G-628; 
M-l to L-627; M-l to A-626; M-l to G-625; M-l to V-624; M-l to A-623; M-ito L-622; 
M-l to D-621; M-l to 1-620; M-l to L-619; M-l to G-618; M-l to D-617; M-l to E-616; 
M-l toN-615; M-l to L-614; M-l to D-613; M-l to L-612; M-l to Q-61I; M-l to G-610; 
M-l to H-609; M-l tol-608; M-l to S-607; M-l to C-606; M-l to G-605; M-l to F-604; 
M-l to Y-603; M-l to Q-602; M-l to L-601;M-1 to G-600; M-l to T-599; M-l to A-598; 
M-l to L-597; M-l to E-596; M-l to S-595; M-l to A-594; M-l toT-593; M-l to 1-592; 
M-l to R-591; M-l to Q-590; M-l to K-589; M-l to P-588; M-l to T-587; M-l to K- 
586;M-1 to L-585; M-l to 1-584; M-l to S-583; M-l to G-582; M-l to R-581; M-l to F- 
580; M-l to G-579; M-l toH-578; M-l to F-577; M-l to 1-576; M-l to Y-575; M-l to I- 
574; M-l to A-573; M-l to G-572; M-l to A-571;M-1 to H-570; M-l to N-569; M-l to 
D-568; M-l to E-567; M-l to L-566; M-l to P-565; M-l to A-564; M-lto G-563; M-l to 
V-562; M-l to V-561; M-l to V-560; M-l to D-559; M-l to N-558; M-l to Y-557; M-l 
toS-556; M-l to D-555; M-l to Q-554; M-l to N-553; M-l to L-552; M-l to D-551; M-l 
to R-550; M-l toV-549; M-l to S-548; M-l to A-547; M-l to 1-546; M-l to S-545; M-l 
to S-544; M-l to G-543; M-l to F-542;M-1 to R-541; M-l to A-540; M-l to N-539; M-l 
to Q-538; M-l to Y-537; M-l to S-536; M-l to H-535; M-lto S-534; M-l to D-533; M-l 
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to K-532; M-l to L-531; M-l to T-530; M-l to G-529; M-l to N-528; M-l toY-527; M-l 
to V-526; M-l to F-525; M-l to R-524; M-l to N-523; M-l to Q-522; M-l to R-521; M- 
1 toL-520; M-l to E-519; M-l to Y-518; M-l to V-517; M-l to Y-516; M-l to V-515; 
M-l to K-514; M-l toG-513;M-l toR-512; M-l to E-51 1; M-l to R-510; M l to G-509; 
M-l to E-508; M-l to N-507; M-l toF-506; M-l to Y-505; M-l to M-504; M-l to P-503; 
M-l to A-502; M-l to G-501; M-l to V-500; M-I toL-499; M-l to L-498; M-l to V-497; 
M-l to D-496; M-l to T-495; M-l to V-494; M-l to G-493; M-l toD-492; M-l to G- 
491; M-l toD-490; M-l to 1-489; M-l to D-488;M-1 to V-487;M-1 to S-486; M-l toT- 
485;M-1 to 1-484; M-l to E-483; M-l to S-482; M-l to G-481; M-l to F-480; M-l to Y- 
479; M-l to S-478; M-l toG-477; M-l to 1-476; M l to Q-475; M-l to Q-474; M-l to G- 
473; M-l to R-472; M-l to M-471; M-l toA-470; M-l to Q-469; M-l to H-468; M-l to 
1-467; M-l to T-466; M-l to L-465; M-l to S-464; M-l to R-463;M-l to N-462; M-l to 
N-461; M-l to H-460; M-i to M-459; M-l to T-458; M-l to F-457; M-l to L-456; M-lto 
1-455; M-l to V-454; M-l to K-453; M-l to G-452; M-l to T-451; M-l to H-450; M-l to 
N-449; M-l toF-448; M-l to R-447; M-l to P-446; M-l to A-445; M-l to G-444; M-l to 
A-443; M-l to V-442; M-l toY-441; M-l to V-440; M-l to R-439; M-l to G-438; M-l 
to Q-437; M-l to R-436; M-l to S-435; M-l toS-434; M-l to V-433; M-l to V-432; M-l 
to S-431; M-l to T-430; M-l to V-429; M-l to T-428; M-l to Y-427;M-1 to G-426; M-l 
to L-425; M-l to Y-424; M-l to A-423; M-l to G-422; M-l to H-421; M-l to N-420; M- 
lto K-419; M-l to L-418; M-l to E-417; M-l to E-416; M-l to P-415; M-l to F-414; M- 
1 to E-413; M-l toK-412; M-l to L-41 1; M-l to Y-410; M-l to S-409; M-l to E-408; M- 
1 to R-407; M-l to L-406; M-l to P-405;M-1 to 1-404; M-l to V-403; M-l to K-402; M- 
1 to G-401; M-l to A-400; M-l to S-399; M-l to T-398; M-l toE-397; M-l to K-396; 
M-i to L-395; M-l to V-394; M-l to A-393; M-l to G-392; M-l to N-391; M-l toW- 
390; M-l to D-389; M-l to Y-388; M-l to A-387; M-l to G-386; M l to V-385; M-l to 
A-384; M-l toG-383; M-l to L-382; M-l to L-381; M-l to V-380; M l to G-379; M-l to 
D-378; M-l to E-377; M-i toV-376; M-l to V-375; M-l to H-374; M-l to S-373; M-l to 
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S-372; M-l to F-371; M-l to G-370; M-l toT-369; M-l to Q-368; M-l to S-367; M-l to 
M-366; M-l to E-365; M-l to L-364; M-l to G-363; M l toF-362; M-l to S-361; M-l to 
T-360; M-l to E-359; M-l to N-358; M-l to K-357; M-l to N-356; M-l toT-355; M-l to 
G-354; M-l to E-353; M-l to L-352; M-l to S-351; M-l to F-350; M-l to 1-349; M-l to 
R-348;M-1 to D-347; M-l to G-346; M-l to L-345; M-l to A-344; M-l to D-343; M-l to 
V-342; M-l to 1-341; M-l toD-340; M-l to K-339; M-l toL-338; M-l to A-337; M-l to 
A-336; M-l to E-335; M-l to D-334; M-l toT-333; M-l to V-332; M-l to N-331; M-l to 
F-330; M-l to F-329; M-l to H-328; M-l to K-327; M-l toD-326; M-l to D-325; M-l to 
P-324; M-l to D-323; M-l to S-322; M-l to A-321; M-l to 1-320; M-l to Y-319;M-1 to 
K-318; M-l to 1-317; M-l toE-316; M-l to N-315; M-l to L 314; M-l toF-313:M-l to 
T-312;M-1 toE-311;M-l to P-310; M-l to N-309; M-l to 1-308; M-l to G-307;M-1 to 
R-306; M-l to R-305; M-l toN-304; M-l to Y-303; M-l to Y-302; M-l to G-301; M-l 
to L-300; M-l to V-299; M-l to A-298; M-l toV-297; M-l to A-296; M-l to Y-295; M- 
1 to R-294; M-l to T-293; M-l to V-292; M-l to N-291; M-l toD-290; M-l to R-289; 
M-l to E-288; M-l to S-287; M-l to Q-286; M-l to Q-285; M-l to 1-284; M-l to V- 
283;M-1 to K-282; M-l to E-281; M-l to L-280; M-l to D-279; M-l to P-278; M-l to S- 
277; M-l to D-276; M-l toH-275; M-l to S-274; M-l to E-273; M-l to G-272; M-l to 
D-271; M-l to T-270; M-l to 1-269; M-l to V-268;M-1 to 1-267; M-l to M-266; M-l to 
V-265; M-l to K-264; M-l to K-263; M-l to A-262; M-l to G-261; M-lto K-260; M-l 
to R-259; M-l to G-258; M-l to G-257; M-l to K-256; M-l to Q-255; M-l to F-254; M- 
1 toA-253; M-l to E-252; M-l to S-251; M-l to R-250; M-l to A-249; M-l to F-248; M- 
1 to E-247; M-l to I-246;M-1 to G-245; M-l to F-244; M-l to A-243; M-l to T-242; M- 
1 to R-241; M-l to T-240; M-l to E-239; M-l toT-238; M-l to G-237; M-l to G-236; 
M-l to R-235; M-l to Q-234; M-l to E-233; M-l to 1-232; M-l toH-231; M-l to S-230; 
M-l to A-229; M-l to A-228; M-l to E-227; M-l to V-226; M-l to V-225; M-l toD- 
224; M-l to K-223; M-l to V-222; M-l to S-221; M-l to R-220; M-l to Y-219; M-l to 
D-218; M-l toN-217; M-l to L-216; M-l to H-215; M-l to F-214; M-l to E-213; M-l to 


SUBSTITUTE SHEET (RULE 26) 


WO 00/29435 PCT/US99/25031 

128 

H-212; M l to V-211; M-l toV-210; M-l to D-209; M-l to E-208; M-l to G-207; M-l 
to Y-206; M l to Q-205; M-l to V-204; M-l toV-203; M-l to G-202; M-l to V-201; M- 
1 to Q-200; M-l to 1-199; M-l to Q-198; M-l to G-197; M-l toP-196; M-l to G-195; M- 
1 to 1-194; M-l to Y-193; M-l to F-192; M-l to K-191; M-l to K-190; M-l to L-189;M- 
1 to 1-188; M-l to N-187; M-l to 1-186; M-l to L-185; M-l to F-184; M-l to H-183; M- 
1 to Q-182; M-l toV-181;M-l to E-180; M-i to V-179; M-l to W-178; M-l to P- 177; 
M-l to Y-I76;M-1 to 1-175; M-l to S-174;M-1 to N-173; M-l toS-172; M-l to G-171; 
M-l to D-170; M-l to L-169; M-l to V-168; M-l to 1-167; M-l toV-166; M-l to 1-165; 
M-l to D-164; M-l to M 163; M-l to Y-162; M-l to T-161;M-1 to Q-160; M-l toC- 
159; M-l to R-158; M-l to Q-157; M-l to L-156; M-l to A-155; M-l to P-154; M l to 
A-153; M-l toV-152; M-l to T-151; M-l to K-150; M-l to S-149; M-l to F-148; M-l to 
R-147; M-l to F-146; M-l toN-145; M-l to S-144; M-l to N-143; M-l to V-142; M-l to 
R-141; M-l to S-140; M-l to C-139; M-l toM-138; M-l to G-137; M-l to T-136; M-l to 
T-135; M-l to Y-134; M-l to Y-133; M-l to S-132; M-l toS-131; M-l to G-130; M-l to 
C-129; M-l to E-128; M-l to H-127; M-l to S-126; M-l to W-125; M-l toL-124; M-l to 
P-123; M-l to S-122; M-l to C-121;M-1 to A-120; M-l to L-119; M-l toF-118;M-l to 
S-117;M-1 to N-116; M-l toD-115;M-l to K-114; M-l to P-113; M-l toN-112;M-l to 
T-lll; M-l to A-110; M-lto L-109; M-l to S-108; M-l to L-107; M-l to G-106; M-l to 
L-105; M-l to R-104; M-l to M-103; M-l toN-102; M-l to D-101; M-l to K-100; M-l 
to R-99; M-l to E-98; M-l to S-97; M-l to V-96; M-l to N-95; M-lto S-94; M-l to L- 
93; M-l to T-92; M-l to V-91; M-l to R-90; M-l to G-89; M-l to L-88; M-l to N-87; 
M-lto L-86; M-l to K-85; M-l to T-84; M-l to C-83; M-l to N-82; M-l to G-81; M-l to 
H-80; M-l to 1-79; M-lto V-78; M-l to P-77; M-l to C-76; M-l to K-75; M-l to Y-74; 
M-l to V-73; M-l to D-72; M-l to G-71; M-lto T-70; M-l to K-69; M-l to Q-68; M-l 
to Y-67; M-l to G-66; M-l to N-65; M-l to T-64; M-l to E-63; M-lto L-62; M-l to P- 
61; M-l to A-60; M-l to G-59; M-l to V-58; M-l to V-57; M-l to L-56; M-l to W-55; 
M-lto K-54; M-l to N-53; M-l to G-52; M-l to S-51; M-l to 1-50; M-l to D-49; M-l to 
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H-48; M-l to Q-47; M-lto Q-46; M-l to V-45; Ml to T-44; M-l to Y-43; M-l to G-42; 
M-l to F-41; M-l to F-40; M-l to A-39; M-lto T-38; M-l to R-37; M-l to S-36; M-l to 
G-35; M-l to P-34; M-l to 1-33; M-l to V-32; M-l to R-31; M-l toP-30; M-l to K-29; 
M-l to R-28; M-l to T-27; M-l to D-26; M-l to M-25; M-l to N-24; M-l to F-23; M-l 
toT-22; M-l toD-21;M-l to T-20; M-l to F-19;M-1 to G-18;M-1 to P-17; M-l to W- 
16; M-l to L-15;M-1 toS-14;M-l to L-13; M-l to A-12; M-l to W-ll; M-l to A-10;M- 
1 to V-9; M-l to V-8; M-l to L-7; M-l to G-6; of SEQ ID NO:35. Polypeptides encoded 
by these polynucleotides are also encompassed by the invention. 

In addition, the invention provides nucleic acid molecules having nucleotide 
sequences related to extensive portions of SEQ ID NO: 17 which have been determined 
from the following related cDNA genes: HEEAB54R (SEQ ID NO: 104), HRDAF83R 
(SEQ ID NO: 105), HOUBC62R (SEQ ID NO: 106), HCDBI19R (SEQ ID NO:107), 
HOHCU94R (SEQ ID NO:108), HOACC13R (SEQ ID NO:109), HCDAP2IR (SEQ ID 
NO:110), HNHHA34R (SEQ ID NO:l 1 1), HOHEA75R (SEQ ID NO:112) and 
HNGEL59R (SEQ ID NO:l 13). 

Based on the sequence similarity to the human integrin alpha 1 subunit, 
translation product of this gene is expected to share at least some biological activities 
with integrin proteins, and specifically the integrin alpha 1 protein. Such activities are 
known in the art, some of which are described elsewhere herein. 

Specifically, polynucleotides and polypeptides of the invention are also useful for 
modulating the differentiation of normal and malignant cells, modulating the 
proliferation and/or differentiation of cancer and neoplastic cells, and modulating the 
immune response. Polynucleotides and polypeptides of the invention may represent a 
diagnostic marker for hematopoietic and immune diseases and/or disorders. The full- 
length protein should be a secreted protein, based upon homology to the integrin family. 
Therefore, it is secreted into serum, urine, or feces and thus the levels is assayable from 
patient samples. Assuming specific expression levels are reflective of the presence of 
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immune disorders, this protein would provide a convenient diagnostic for early detection. 
In addition, expression of this gene product may also be linked to the progression of 
immune diseases, and therefore may itself actually represent a therapeutic or therapeutic 
target for the treatment of cancer. 

Polynucleotides and polypeptides of the invention may play an important role in 
the pathogenesis of human cancers and cellular transformation, particularly those of the 
immune and hematopoietic systems. Polynucleotides and polypeptides of the invention 
may also be involved in the pathogenesis of developmental abnormalities based upon its 
potential effects on proliferation and differentiation of cells and tissue cell types. Due to 
the potential proliferating and differentiating activity of said polynucleotides and 
polypeptides, the invention is useful as a therapeutic agent in inducing tissue 
regeneration, for treating inflammatory conditions (e.g., inflammatory bowel syndrome, 
diverticulitis, etc.). Moreover, the invention is useful in modulating the immune 
response to aberrant polypeptides, as may exist in rapidly proliferating ceils and tissue 
cell types, particularly in adenocarcinoma cells, and other cancers. 

Alternatively, the expression within cellular sources marked by proliferating cells 
indicates this protein may play a role in the regulation of cellular division, and may show 
utility in the diagnosis, treatment, and/or prevention of developmental diseases and 
disorders, including cancer, and other proliferative conditions. Representative uses are 
described in the "Hyperproliferative Disorders" and "Regeneration" sections below and 
elsewhere herein. Briefly, developmental tissues rely on decisions involving cell 
differentiation and/or apoptosis in pattern formation. 

Dysregulation of apoptosis can result in inappropriate suppression of cell death, 
as occurs in the development of some cancers, or in failure to control the extent of cell 
death, as is believed to occur in acquired immunodeficiency and certain 
neurodegenerative disorders, such as spinal muscular atrophy (SMA). 
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Alternatively, this gene product is involved in the pattern of cellular proliferation 
that accompanies early embryogenesis. Thus, aberrant expression of this gene product in 
tissues - particularly adult tissues - may correlate with patterns of abnormal cellular 
proliferation, such as found in various cancers. Because of potential roles in proliferation 
and differentiation, this gene product may have applications in the adult for tissue 
regeneration and the treatment of cancers. It may also act as a morphogen to control cell 
and tissue type specification. Therefore, the polynucleotides and polypeptides of the 
present invention are useful in treating, detecting, and/or preventing said disorders and 
conditions, in addition to other types of degenerative conditions. Thus 
this protein may modulate apoptosis or tissue differentiation and is useful in the 
detection, treatment, and/or prevention of degenerative or proliferative conditions and 
diseases. The protein is useful in modulating the immune response to aberrant 
polypeptides, as may exist in proliferating and cancerous cells and tissues. The protein 
can also be used to gain new insight into the regulation of cellular growth and 
proliferation. Furthermore, the protein may also be used to determine biological activity, 
to raise antibodies, as tissue markers, to isolate cognate ligands or receptors, to identify 
agents that modulate their interactions, in addition to its use as a nutritional supplement. 
Protein, as well as, antibodies directed against the protein may show utility as a tumor 
marker and/or immunotherapy targets for the above listed tissues. 

This gene is expressed almost exclusively in osteoblasts, human trabecular bone 
cells, messangial cells, adipocytes, and to a lesser extent in osteosarcoma, 
chondrosarcoma, breast cancer cells, and bone marrow. 

Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identification of the tissue(s) or cell type(s) present in a 
biological sample and for diagnosis of the following diseases and conditions which 
include, but are not limited to, disorders of the skeletal system, connective tissues, and 
immune and hematpoietic diseases and/or disorders. Similarly, polypeptides and 
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antibodies directed to these polypeptides are useful to provide immunological probes for 
differential identification of the tissue(s) or cell type(s). For a number of disorders of the 
above tissues or cells, particularly of the connective tissue and skeletal system, 
expression of this gene at significantly higher or lower levels is detected in certain 
tissues or cell types (e.g. immune, hematopoietic, skeletal, bone, cartilage, 
develpomental, reproductive, secretory, and cancerous and wounded tissues) or bodily 
fluids or cell types (e.g., lymph, serum, plasma, urine, synovial fluid or spinal fluid) or 
another tissue or cell sample taken from an individual having such a disorder, relative to 
the standard gene expression level, i.e., the expression level in healthy tissue from an 
individual not having the disorder. 

Preferred polypeptides of the present invention comprise immunogenic epitopes 
shown in SEQ ID NO: 35 as residues: Phe-23 to Arg-31, Leu-62 to Asp-72, VaI-96 to 
Asp-101, Thr-Ul to Asn-116, Glu-128 to Thr-135, Val-142 to Ser-149, Asn-217 to Val- 
222, Glu-233 to Arg-241, Gly-272 to Leu-280, Gln-286 to Thr-293, Tyr-303 to Ile-308, 
Gly-354 to Thr-360, Glu-408 to Lys-419, Glu-508 to Lys-514, Arg-521 to Val-526, Gly- 
529 to Phe-542, Asp-551 to Tyr-557, Thr-587 to Thr-593, His-656 to Asp-665, Met-697 
to Arg-705, Asp-709 to Thr-716, Glu-755 to Gly-760, Asn-779 to His-786, Leu-810 to 
Asp-816, Leu-844 to Ala-851, Gln-871 to Gly-877, Glu~884 to GIn-889, Ser-931 to Asn- 
943, Ser-974 to IIe-982, GIy-1039 to Gln-1058, Arg-1 121 to Arg-1 127, Ser-1 134 to Trp- 
1 139, Ser-1 172 to Pro-1 183. Polynucleotides encoding said polypeptides are also 
provided. 

The tissue distribution in osteoblasts and homology to integrin alpha subunit 10 
indicates that the protein products of this gene are useful for the treatment of disorders 
and conditions affecting the skeletal system, in particular osteoporosis as well as 
disorders afflicting connective tissues (e.g. arthritis, trauma, tendonitis, chrondomalacia 
and inflammation), such as in the diagnosis and treatment of various autoimmune 
disorders such as rheumatoid arthritis, lupus, scleroderma, and dermatomyositis as well 
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as dwarfism, spinal deformation, and specific joint abnormalities as well as 
chondrodysplasias (ie. spondyloepiphyseal dysplasia congenita, familial osteoarthritis, 
Atelosteogenesis type II, metaphyseal chondrodysplasia type Schmid). polynucleotides 
and polypeptides corresponding to this gene are useful for the treatment and diagnosis of 
5 hematopoietic related disorders such as anemia, pancytopenia, leukopenia, 

thrombocytopenia or leukemia since stromal cells are important in the production of cells 
of hematopoietic lineages. Such a use is consistent with the observed homology to 
integrin family members, in conjunction with The tissue distribution in bone 

marrow ceils. Integrins play pivotal roles in cell migration, inflammation, proliferation, 

10 and cellular infiltration. Thus, the present invention is expected to share at least some of 
these activities. Representative uses are described in the "Immune Activity" and 
"infectious disease" sections below, in Example 11, 13, 14, 16, 18, 19, 20, and 27, and 
elsewhere herein. Briefly, the uses include bone marrow cell ex-vivo culture, bone 
marrow transplantation, bone marrow reconstitution, radiotherapy or chemotherapy of 

15 neoplasia. The gene product may also be involved in lymphopoiesis, therefore, it can be 
used in immune disorders such as infection, inflammation, allergy, immunodeficiency 
etc. In addition, this gene product may have commercial utility in the expansion of stem 
cells and committed progenitors of various blood lineages, and in the differentiation 
and/or proliferation of various cell types. Based upon the tissue distribution of this 

20 protein, antagonists directed against this protein is useful in blocking the activity of this 
protein. Accordingly, preferred are antibodies which specifically bind a portion of the 
translation product of this gene. 

Also provided is a kit for detecting tumors in which expression of this protein 
occurs. Such a kit comprises in one embodiment an antibody specific for the translation 

25 product of this gene bound to a solid support. Also provided is a method of detecting 

these tumors in an individual which comprises a step of contacting an antibody specific 
for the translation product of this gene to a bodily fluid from the individual, preferably 
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serum, and ascertaining whether antibody binds to an antigen found in the bodily fluid. 
Preferably the antibody is bound to a solid support and the bodily fluid is serum. The 
above embodiments, as well as other treatments and diagnostic tests (kits and methods), 
are more particularly described elsewhere herein. Furthermore, the protein may also be 
5 used to determine biological activity, to raise antibodies, as tissue markers, to isolate 
cognate ligands or receptors, to identify agents that modulate their interactions, in 
addition to its use as a nutritional supplement. Protein, as well as, antibodies directed 
against the protein may show utility as a tumor marker and/or immunotherapy targets for 
the above listed tissues. Many polynucleotide sequences, such as EST sequences, are 

10 publicly available and accessible through sequence databases. Some of these sequences 
are related to SEQ ID NO: 17 and may have been publicly available prior to conception 
of the present invention. Preferably, such related polynucleotides are specifically 
excluded from the scope of the present invention. To list every related sequence is 
cumbersome. Accordingly, preferably excluded from the present invention are one or 

15 more polynucleotides comprising a nucleotide sequence described by the general 
formula of a-b, where a is any integer between 1 to 4981 of SEQ ID NO: 17, b is an 
integer of 15 to 4995, where both a and b correspond to the positions of nucleotide 
residues shown in SEQ ID NO: 17, and where b is greater than or equal to a + 14. 

20 

FEATURES OF PROTEIN ENCODED BY GENE NO: 8 

The present invention relates to three novel peptidoglycan recognition binding 
proteins expressed by keratinocytes, wound-healing tissues and chondrosarcoma tissue. 
25 More specifically, isolated nucleic acid molecules are provided encoding a human 
peptidoglycan recognition protein-related protein, sometimes referred to herein as 
"human tag7" or "tag7" or "htag7". Further provided are vectors, host cells and 
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recombinant methods for producing the same. The invention also relates to both the 
inhibition and enhancement of activities of the tag7 protein, polypeptides and diagnostic 
methods for detecting tag7 gene expression. 

Peptidoglycan, as well as Lipopoiysaccharide (LPS), is a surface component of 
5 many bacteria which illicit a wide range of physiological and immune responses in 
humans. Specifically, peptidoglycan has been shown to manifest itself clinically by 
reproducing most of the symptoms of bacterial infection, including fever, acute-phase 
response, inflammation, septic shock, leukocytosis, sleepiness, malaise, abcess 
formation, and arthritis (see Dziarski et al., JBC, 273 (15): 8680 (1998)). Furthermore, 

10 the type of peptidoglycan (i.e.- the specific stereoisomers or analogs of muramyl 

dipeptide, N-acetylglucosaminyl-beta(l-4)-N-acteylmuramyl tetrapeptides, etc.), were 
shown to elicit a broad range of activities, including exhibiting greater pyrogenicity, 
inducing acute joint inflammation, stimulating macrophages, and causing hemorrhagic 
necrosis at a primed site (See Kotani et aL, Fed Proc, 45(11): 2534 (1986)). It has been 

15 demonstrated in humans that a lipopoiysaccharide binding protein exists that was 

discovered as a trace plasma protein (See Schumann et al., Science, 249(4975): 1429 
(1990)). It is thought that one of the modes of action by which this lipopoiysaccharide 
binding protein functions is by forming high-affinity complexes with lipopoiysaccharide, 
that then bind to macrophages and monocytes, inducing the secretion of tumor necrosis 

20 factor. Dziarski and Gupta (See Dziarski et aL, JBC, 269(3): 2100 (1994)) demonstrated 
that a 70kDa receptor protein present on the surface of mouse lymphocytes served to 
bind heparin, heparinoids, bacterial lipoteichoic acids, peptidoglycan, and 
lipopolysaccharides. Recently, Dziarski et al. demonstrated that the CD 14, a 
glycosylphosphattdylinositol-linked protein present on the surface of macrophage and 

25 polymorphonuclear leukocytes, bound peptidoglycan and lipopoiysaccharide. 

Furthermore, the binding affinity of CD14 for lipopoiysaccharide was 
significantly increased in the presence of a LPS-binding protein present in plasma. It is 
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thought that the LPS-binding protein functions as a transfer molecule, whereby it binds 
LPS and presents it to the CD 14 receptor (See Dziarski et al., JBC, 273(15): 8680 
(1998)). Yoshida et al. isolated a peptidoglycan binding protein from the hemolymph of 
the Silkworm, Bombyx mori, using column chromatography. This protein was found to 
have a very specific affinity for peptidoglycan (See Yoshida et al., JBC, 271(23): 13854 
(1996)). 

Additionally, Kang et al. recently cloned a peptidoglycan binding protein from 
the moth Trichoplusia ni. The peptidoglycan binding protein was shown to bind strongly 
to insoluble peptidoglycan (See Kang etal., PNAS, 95(17): 10078 (1998)). In this study 
the peptidoglycan binding protein was upregulated by a bacterial infection in T. ni. The 
insect immune system is regarded as a model for innate immunity. Thus, Kang et al were 
able to gene both mouse and human homologs of the T. ni peptidoglycan binding protein. 
All of these peptidoglycan binding proteins shared regions of homology, as well as four 
conserved cysteine residues which may function in the tertiary structure of the protein, 
possibly in helping to form binding domains. Given that peptidoglycan is an integral 
component of bacterial cell walls, and that it induces many physiological responses from 
cytokine secretion to inflammation and macrophage activation, it appears as if this family 
of proteins is a ubiquitous group involved in the binding and recognition of 
peptidoglycan, the presentation of antigens (e.g., cell wall components, etc.), and the 
activation of the immune system, such as the secretion of cytokines, such as TNF. TNF is 
noted for its pro-inflammatory actions which result in tissue injury, such as induction of 
procoagulant activity on vascular endothelial cells (Pober, J.S. et al., J. Immunol. 
136:1680 (1986)), increased adherence of neutrophils and lymphocytes (Pober, J.S. et al., 
J. Immunol. 138:3319 (1987)), and stimulation of the release of platelet activating factor 
from macrophages, neutrophils and vascular endothelial cells (Camussi, G. et at, J. Exp. 
Med. 166:1390(1987)). 
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Recent evidence implicates TNF in the pathogenesis of many infections (Cerami, 
A. et al., Immunol. Today 9:28 (1988)), immune disorders, neoplastic pathology, e.g., in 
cachexia accompanying some malignancies (Oliff, A. et al., Cell 50:555 (1987)), and in 
autoimmune pathologies and graft-versus host pathology (Piguet, P.-F. et al., J. Exp. 
5 Med. 166:1280 (1987)). The association of TNF with cancer and infectious pathologies 
is often related to the host's catabolic state. A major problem in cancer patients is weight 
loss, usually associated with anorexia. The extensive wasting which results is known as 
"cachexia" (Kern, K. A. et al. J. Parent. Enter. Nutr. 12:286-298 (1988)). Cachexia 
includes progressive weight loss, anorexia, and persistent erosion of body mass in 

10 response to a malignant growth. The cachectic state is thus associated with significant 
morbidity and is responsible for the majority of cancer mortality. 

A number of studies have suggested that TNF is an important mediator of the 
cachexia in cancer, infectious pathology, and in other catabolic states. TNF is thought to 
play a central role in the pathophysiological consequences of Gram-negative sepsis and 

15 endotoxic shock (Michie, H.R. et al, Br. J. Surg. 76:670-671 (1989); Debets, J. M. H. et 
al., Second Vienna Shock Forum, p.463-466 (1989); Simpson, S. Q. et al., Crit. Care 
Clin. 5:27-47 (1989)), including fever, malaise, anorexia, and cachexia. Endotoxin is a 
potent monocyte/macrophage activator which stimulates production and secretion of 
TNF (Kombluth, S.K. et al., J. Immunol. 137:2585-2591 (1986)) and other cytokines. 

20 Because TNF could mimic many biological effects of endotoxin, it was concluded to be a 
central mediator responsible for the clinical manifestations of endotoxin-related illness. 
TNF and other monocyte-derived cytokines mediate the metabolic and neurohormonai 
responses to endotoxin (Michie, H.R. et al., N. Eng. J. Med. 318:1481-1486 (1988)). 
Endotoxin administration to human volunteers produces acute illness with flu- like 

25 symptoms including fever, tachycardia, increased metabolic rate and stress hormone 
release (Revhaug, A. et al., Arch. Surg. 123:162-170 (1988)). Elevated levels of 
circulating TNF have also been found in patients suffering from Gram-negative sepsis 
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(Waage, A. et al., Lancet 1:355-357 (1987); Hammerle, A.F. et al., Second Vienna Shock 
Forum p. 715-718 (1989); Debets, J. M. H. et al., Crit. Care Med. 17:489-497 (1989); 
Calandra, T. et al., J. Infec. Dis. 161:982-987 (1990)). Passive immunotherapy directed 
at neutralizing TNF may have a beneficial effect in Gram-negative sepsis and 
5 endotoxemia, based on the increased TNF production and elevated TNF levels in these 
pathology states, as discussed above. 

Antibodies to a "modulator" material which was characterized as cachectin (later 
found to be identical to TNF) were disclosed by Cerami et al. (EPO Patent Publication 
0,212,489, March 4, 1987). Such antibodies were said to be useful in diagnostic 

10 immunoassays and in therapy of shock in bacteria] infections. Rubin et al. (EPO Patent 
Publication 0,218,868, April 22, 1987) disclosed monoclonal antibodies to human TNF, 
the hybridomas secreting such antibodies, methods of producing such antibodies, and the 
use of such antibodies in immunoassay of TNF. Yone et al. (EPO Patent Publication 
0,288,088, October 26, 1988) disclosed anti-TNF antibodies, including mAbs, and their 

15 utility in immunoassay diagnosis of pathologies, in particular Kawasaki's pathology and 
bacterial infection. The body fluids of patients with Kawasaki's pathology (infantile acute 
febrile mucocutaneous lymph node syndrome; Kawasaki, T., Allergy 16:178 (1967); 
Kawasaki, T., Shonica (Pediatrics) 26:935 (1985)) were said to contain elevated TNF 
levels which were related to progress of the pathology (Yone et al., supra). 

20 Accordingly, there is a need to provide molecules that are involved in 

pathological conditions. Such novel proteins could be useful in augmenting the immune 
system in such areas as immune recognition, antigen presentation, and immune system 
activation. Antibodies or antagonists directed against these proteins is useful in reducing 
or eliminating disorders associated with TNF and TNF-like cytokines, such as endotoxic 

25 shock and auto-immune disorders, for example. 

The polypeptide of the present invention has been putatively identified as a 
member of the novel peptidoglycan recognition binding protein family and has been 
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termed human tag7. This identification has been made as a result of amino acid sequence 
homology to the mouse tag7 (See Genbank Accession No. emb|CAA60133). 

Figure 34 shows the nucleotide (SEQ ID NO: 18) and deduced amino acid 
sequence (SEQ ID NO:36) of htag7. Predicted amino acids from about 1 to about 21 
5 constitute the predicted signal peptide (amino acid residues from about 1 to about 21 in 
SEQ ID NO:36) and are represented by the underlined amino acid regions; and amino 
acids from about 34 to about 117 constitute the predicted PGRP-like domain (amino 
acids from about 34 to about 1 17 in SEQ ID NO: 36) and are represented by the double 
underlined amino acids. 

10 Figure 35 shows the regions of similarity between the amino acid sequences of 

the htag7 protein (SEQ ID NO:36) and the mouse tag7 protein (SEQ ID NO:l 14). 

Figure 36 shows an analysis of the htag7 amino acid sequence. Alpha, beta, turn 
and coil regions; hydrophilicity and hydrophobicity; amphipathic regions; flexible 
regions; antigenic index and surface probability are shown. 

15 A polynucleotide encoding a polypeptide of the present invention is obtained 

from human chondrosarcoma cells, bone marrow, and neutrophils. The polynucleotide of 
this invention was discovered in a human chondrosarcoma cDNA library. 

As shown in Figure 34, htag7 has a PGRP domain (the PGRP domain comprise 
amino acids from about 34 to about 1 17 of SEQ ID NO:36; which correspond to amino 

20 acids from about 34 to about 1 17 of Figure 34). The polynucleotide contains an open 

reading frame encoding the htag7 polypeptide of 198 amino acids. htag7 exhibits a high 
degree of homology at the amino acid level to the mouse tag7 (as shown in Figure 35). 
The present invention provides isolated nucleic acid molecules comprising a 
polynucleotide encoding the htag7 polypeptide having the amino acid sequence shown in 

25 Figure 34 (SEQ ID NO:36). The nucleotide sequence shown in Figure 34 (SEQ ID 

NO: 18) was obtained by sequencing a cloned cDNA (HCDDP40), which was deposited 
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on November 17 at the American Type Culture Collection, and given Accession Number 
203484. 

The present invention is further directed to fragments of the isolated nucleic acid 
molecules described herein. By a fragment of an isolated DNA molecule having the 
5 nucleotide sequence of the deposited cDNA or the nucleotide sequence shown in SEQ ID 
NO: 18 is intended DNA fragments at least about 15nt, and more preferably at least about 
20 nt, still more preferably at least about 30 nt, and even more preferably, at least about 
40 nt in length which are useful as diagnostic probes and primers as discussed herein. Of 
course, larger fragments 50-1500 nt in length are also useful according to the present 

10 invention, as are fragments corresponding to most, if not all, of the nucleotide sequence 
of the deposited cDNA or as shown in SEQ ID NO: 18. By a fragment at least 20 nt in 
length, for example, is intended fragments which include 20 or more contiguous bases 
from the nucleotide sequence of the deposited cDNA or the nucleotide sequence as 
shown in SEQ ID NO: 18. In this context "about" includes the particularly recited size, 

15 larger or smaller by several (5, 4, 3, 2, or 1) nucleotides, at either terminus or at both 
termini. Representative examples of htag7 polynucleotide fragments of the invention 
include, for example, fragments that comprise, or alternatively, consist of, a sequence 
from about nucleotide 1 to about 50, from about 51 to about 100, from about 101 to 
about 150, from about 151 to about 200, from about 201 to about 250, from about 251 to 

20 about 300, from about 301 to about 350, from about 351 to about 400, from about 401 to 
about 450, from about 451 to about 500, from about 501 to about 550, from about 551 to 
about 600, from about 601 to about 650, from about 651 to about 700, from about 701 to 
about 726, and from about 130 to about 379 of SEQ ID NO: 18, or the complementary 
strand thereto, or the cDNA contained in the deposited gene. In this context "about" 

25 includes the particularly recited ranges, larger or smaller by several (5, 4, 3, 2, or 1 ) 
nucleotides, at either terminus or at both termini. 
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Preferred nucleic acid fragments of the present invention include nucleic acid 
molecules encoding a member selected from the group: a polypeptide comprising or 
alternatively, consisting of, the PGRP-like domain (amino acid residues from about 34 to 
about 1 17 in Figure 34 (amino acids from about 34 to about 1 17 in SEQ ID NO:36). 
5 Since the location of these domains have been predicted by computer analysis, one of 
ordinary skill would appreciate that the amino acid residues constituting these domains 
may vary slightly (e.g., by about 1 to 15 amino acid residues) depending on the criteria 
used to define each domain. As indicated, nucleic acid molecules of the present invention 
which encode a htag7 polypeptide may include, but are not limited to those encoding the 

10 ammo acid sequence of the PGRP-like domain of the polypeptide, by itself; and the 

coding sequence for the PGRP-like domain of the polypeptide and additional sequences, 
such as a pre-, or pro or prepro- protein sequence. In additional embodiments, the 
polynucleotides of the invention encode functional attributes of htag7. 

Preferred embodiments of the invention in this regard include fragments that 

15 comprise alpha-helix and alpha-helix forming regions ("alpha-regions"), beta-sheet and 
beta-sheet forming regions ("beta-regions"), turn and turn-forming regions ("turn- 
regions"), coil and coil-forming regions ("coil-regions"), hydrophilic regions, 
hydrophobic regions, alpha amphipathic regions, beta amphipathic regions, flexible 
regions, surface-forming regions and high antigenic index regions of htag7. The data 

20 representing the structural or functional attributes of htag7 set forth in Figure 36 and/or 
Table XII, as described above, was generated using the various modules and algorithms 
of the DNA*STAR set on default parameters. In a preferred embodiment, the data 
presented in columns VIII, IX, XIII, and XIV of Table XII can be used to determine 
regions of htag7 which exhibit a high degree of potential for antigenicity. Regions of 

25 high antigenicity are determined from the data presented in columns VIII, IX, XIII, 
and/or XIV by choosing values which represent regions of the polypeptide which are 
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likely to be exposed on the surface of the polypeptide in an environment in which antigen 
recognition may occur in the process of initiation of an immune response. 

Certain preferred regions in these regards are set out in Figure 36, but may, as 
shown in Table XII, be represented or identified by using tabular representations of the 
data presented in Figure 36. The DNA*STAR computer algorithm used to generate 
Figure 36 (set on the original default parameters) was used to present the data in Figure 
36 in a tabular format (See Table XII). The tabular format of the data in Figure 36 is used 
to easily determine specific boundaries of a preferred region. The above-mentioned 
preferred regions set out in Figure 36 and in Table XII include, but are not limited to, 
regions of the aforementioned types identified by analysis of the amino acid sequence set 
out in Figure 34. As set out in Figure 36 and in Table XII, such preferred regions include 
Gamier-Robson alpha-regions, beta-regions, turn-regions, and coil-regions, Chou- 
Fasman alpha-regions, beta-regions, and turn-regions, Kyte-Doolittle hydrophilic regions 
and Hopp- Woods hydrophobic regions, Eisenberg alpha- and beta-amphipathic regions, 
Karplus-Schulz flexible regions, Jameson-Wolf regions of high antigenic index and 
Emini surface-forming regions. Even if deletion of one or more amino acids from the N- 
terminus of a protein results in modification of loss of one or more biological functions 
of the protein, other functional activities (e.g., biological activities, ability to 
multimerize, modulate cellular interaction, or signalling pathways, etc.) may still be 
retained. For example, the ability of shortened htag7 muteins to induce and/or bind to 
antibodies which recognize the complete or mature forms of the polypeptides generally 
will be retained when less than the majority of the residues of the complete or mature 
polypeptide are removed from the N-terminus. Whether a particular polypeptide lacking 
N-terminal residues of a complete polypeptide retains such immunologic activities can 
readily be determined by routine methods described herein and otherwise known in the 
art. It is not unlikely that an htag7 mutein with a large number of deleted N-terminal 
amino acid residues may retain some biological or immunogenic activities. In fact, 
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peptides composed of as few as six htag7 amino acid residues may often evoke an 
immune response. 

Accordingly, the present invention further provides polypeptides having one or 
more residues deleted from the amino terminus of the htag7 amino acid sequence shown 
in Figure 34, up to the proline residue at position number 191 and polynucleotides 
encoding such polypeptides. In particular, the present invention provides polypeptides 
comprising the amino acid sequence of residues n 1-196 of Figure 34 , where nl is an 
integer from 2 to 191 corresponding to the position of the amino acid residue in Figure 
34 (which is identical to the sequence shown as SEQ ID NO:36). In another embodiment, 
N-terminal deletions of the htag7 polypeptide can be described by the general formula 
n2-196, where n2 is a number from 2 to 191, corresponding to the position of amino acid 
identified in Figure 34. N-terminal deletions of the htag7 polypeptide of the invention 
shown as SEQ ID NO:36 include polypeptides comprising the amino acid sequence of 
residues: N-terminal deletions of the htag7 polypeptide of the invention shown as SEQ 
ID NO:36 include polypeptides comprising the amino acid sequence of residues: S-2 to 
P-196; R-3 to P-196; R-4 to P-196; S-5 to P-196; M-6 to P-196; L-7 to P-196; L-8 to P- 
196; A-9 to P-196; W-10 to P-196; A-ll to P-196; L-12 to P-196; P-13 to P-196; S-14 to 
P-196; L-15 to P-196; L-16 to P-196; R-17 to P-196; L-18 to P-196; G-19 to P-196; A-20 
to P-196; A-21 to P-196; Q-22 to P-196; E 23 to P-196; T-24 to P-196; E-25 to P-196; 
D-26 to P-196; P-27 to P-196; A-28 to P-196; C-29 to P-196; C-30 to P-196; S-31 to P- 
196; P-32 to P-196; 1-33 to P-196; V-34 to P-196; P-35 to P-196; R-36 to P-196; N-37 to 
P-196; E-38 to P-196; W-39 to P-196; K-40 to P-196; A-41 to P-196; L-42 to P-196; A- 
43 to P-196; S-44 to P-196; E-45 to P-196; C-46 to P-196; A-47 to P-196; Q-48 to P- 
196; H-49 to P-196; L-50 to P-196; S-51 to P-196; L-52 to P-196; P-53 to P-196; L-54 to 
P-196; R 55 to P-196; Y-56 to P-196; V-57 to P-196; V-58 to P-196; V-59 to P-196; S- 
60 to P-196; H-61 to P-196; T-62 to P-196; A-63 to P-196; G-64 to P-196; S-65 to P- 
196; S-66 to P-196; C-67 to P-196; N-68 to P-196; T-69 to P-196; P-70 to P-196; A-71 
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to P-196; S-72 to P-196; C-73 to P-196; Q-74 to P-196; Q-75 to P-196; Q-76 to P-196; 
A-77 to P-196; R-78 to P-196; N-79 to P-196; V-80 to P-196; Q-81 to P-196; H-82 to P 
196; Y-83 to P-196; H-84 to P-196; M-85 to P-196; K-86 to P-196; T-87 to P-196; L-88 
to P-196; G-89 to P-196; W-90 to P-196; C-91 to P-196; D-92 to P-196; V-93 to P-196; 
G-94 to P-196; Y-95 to P-196; N-96 to P-196; F-97 to P-196; L-98 to P-196; 1-99 to P- 
196; G-100 to P-196; E-101 to P-196; D-102 to P-196; G-103 to P-196; L-104 to P-196; 
V-105 to P-196; Y-106 to P-196; E-I07 to P-196; G-108 to P-196; R-109 to P-196; G- 
1 10 to P-196; W-lll to P-196; N-l 12 to P-196; F-l 13 to P-196; T-l 14 to P-196; G-l 15 
to P-196; A-l 16 to P-196; H-l 17 to P-196; S-l 1 8 to P-196; G-l 19 to P-196; H-120 to P- 
196; L-121 to P-196; W-122 to P-196; N-123 to P-196; P-124 to P-196; M-125 to P-196; 
S-126 to P-196; 1-127 to P-196; G-128 to P-196; 1-129 to P-196; S-130 to P-196; F-131 
to P-196; M-132 to P-196; G-133 to P-196; N-I34 to P-196; Y-135 to P-196; M-136 to 
P-196; D-137 to P-196; R-138 to P-196; V-139 to P-196; P-140 to P-196; T-141 to P- 
196; P-142 to P-196; Q-143 to P-196; A-l 44 to P-196; 1-145 to P-196; R-146 to P-196; 
A-147 to P-196; A- 148 to P-196; Q-149 to P-196; G 150 to P-196; L-151 to P-196; L- 
152 to P-196; A-153 to P-196; C-154 to P-196; G-155 to P-196; V-156 to P-196; A-l 57 
to P-196; Q-158 to P-196; G-159 to P-196; A- 160 to P-196; L-161 to P-196; R-162 to P- 
196; S-163 to P-196; N-164 to P-196; Y-165 to P-196; V-166 to P-196; L-167 to P-196; 
K-168 to P-196; G-169 to P-196; H-l 70 to P-196; R-171 to P-196; D-172 to P-196; V- 
173 to P-196; Q-174 to P-196; R-175 to P-196; T-l 76 to P-196; L-177 to P-196; S-178 to 
P-196; P-179 to P-196; G-l 80 to P-196; N-181 to P-196; Q-182 to P-196; L-183 to P- 
196; Y-184 to P-196; H-185 to P-196; L-186 to P-196; 1-187 to P-196; Q-188 to P-196; 
N-l 89 to P-196; W-190 to P-196; P-191 to P-196; of SEQ ID NO:36. Polypeptides 
encoded by these polynucleotides are also encompassed by the invention. 

Also as mentioned above, even if deletion of one or more amino acids from the 
C-terminus of a protein results in modification or loss of one or more biological 
functions of the protein, other functional activities (e.g., biological activities ) may still 


SUBSTITUTE SHEET (RULE 26) 


WO 00/29435 PCTYUS99/25031 

145 

be retained. For example the ability of the shortened htag7 mutein to induce and/or bind 
to antibodies which recognize the complete or mature forms of the polypeptide generally 
will be retained when less than the majority of the residues of the complete or mature 
polypeptide are removed from the C-terminus. Whether a particular polypeptide lacking 
C-terminal residues of a complete polypeptide retains such immunologic activities can 
readily be determined by routine methods described herein and otherwise known in the 
art. It is not unlikely that a htag7 mutein with a large number of deleted C-terminal 
amino acid residues may retain some biological or immunogenic activities. In fact, 
peptides composed of as few as six htag7 amino acid residues may often evoke an 
immune response. 

Accordingly, the present invention further provides polypeptides having one or 
more residues deleted from the carboxy terminus of the amino acid sequence of the htag7 
polypeptide shown in Figure 34 , up to the methionine residue at position number 6, and 
polynucleotides encoding such polypeptides. In particular, the present invention provides 
polypeptides comprising the amino acid sequence of residues 1-ml of Figure 1, where 
ml is an integer from 6 to 196 corresponding to the position of the amino acid residue in 
Figure 34 . Moreover, the invention provides polynucleotides encoding polypeptides 
comprising, or alternatively consisting of, the amino acid sequence of C-terminal 
deletions of the htag7 polypeptide of the invention shown as SEQ ID NO:36 include 
polypeptides comprising the amino acid sequence of residues: M-l to S-I95; M-l to R- 
194; M-l to Y-193; M-l to H-192;M-1 to P-191;M-1 to W-190; M-l to N-189;M-1 to 
Q-188; M-l to 1-187; M-l to L-186; M-l to H-I85; M-l to Y-184; M-l to L-183; M-l to 
Q-182; M-l to N-181; M-l to G-180; M-l to P-179; M-l to S-178; M-l to L-177; M-l to 
T-176; M-l to R-I75; M-l to Q-174; M-l to V-173; M-l to D-172; M-l to R-171; M-l 
to H-170; M-l to G-169; M-l to K-168; M-l to L-167; M-l to V-166; M-l to Y-165; M- 
1 to N-164; M-l to S-163; M-l to R-162; M-l to L-161; M-l to A-160; M-l to G-159; 
M-l to Q-158; M-l to A-157; M-I to V-156; M-l to G-155; M-l to C-154; M-l to A- 
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153; M-l to L-152; M-I to L-151; M-l to G-150; M-l to Q-149; M-l to A-148; M-l to 
A-147; M-l to R-146; M-l to 1-145; M-l to A- 144; M-l to Q-143; M-l to P-142; M-l to 
T-141; M-l to P-140; M l to V-139; M-l to R-138; M-l to D-137; M-l to M-136; M-l 
to Y-135; Ml to N-134; M-l to G-133; M-l to M-132; M-l to F-I31; M-l to S-130; M- 
1 to 1-129; M-l to G-128; M-l to 1-127; M-l to S-126; M-l to M-125; M-l to P-124; M- 
1 to N-123; M-l to W-122; M-l to L-121; M-l to H-120; M-l to G-l 19; M-l to S-l 18; 
M-l to H-117; M-l to A-116; M-l to G-l 15; M-l toT-114;M-l toF-113; M-l to N- 
112; M-l to W-111;M-1 to G-l 10; M-l to R-109;M-1 to G-108; M-l toE-I07;M-l to 
Y-106;M-1 to V-105;M-1 to L-104; M-l to G-103; M-l to D-102; M-l toE-101;M-l 
to G-100; M-l to 1-99; M-l to L-9S; M-l to F-97; M-l to N-96; M-l to Y-95; M-l to G- 
94; M-l to V-93; M-l toD-92; M-l toC-91;M-l to W-90; M-l to G-89;M-1 toL-88; 
M-l to T-87; M-l to K-86; M-l to M-85; M-l to H-84; M-l to Y-83; M-l to H-82; M-l 
to Q-81; M-l to V-80; M-l to N-79; M-l to R-78; M-l to A-77; M-l to Q-76; M-l to Q- 
75; M-l to Q-74; M-l to C-73; M-l to S-72; M-l to A-71; M-l to P-70; M-I to T-69; M- 
1 to N-68; M-l to C-67; M-l to S-66; M-l to S-65; M-l to G-64; M-l to A-63; M-l to T- 
62; M-l to H-61; M-l to S-60; M-l to V-59; M-l to V-58; M-l to V-57; M-l to Y-56; 
M-l to R-55; M-l to L-54; M-l to P-53; M-l to L-52; M-l to S-51; M-l to L-50; M-l to 
H-49; M-l to Q-48; M-I to A-47; M-l to C46; M-l to E-45; M-l to S-44; M-l to A-43; 
M-l to L-42; M-l to A-41 ; M-l to K-40; M-l to W-39; M-l to E-38; M-l to N-37; M-l 
to R-36; M-l to P-35; M-l to V-34; M-l to 1-33; M-l to P-32; M-l to S-31; M-l to C- 
30; M-l to C-29; M-l to A-28; M-l to P-27; M-l to D-26; M-l to E-25; M-l to T-24; M- 
1 to E-23; M-l to Q-22; M-l to A-21; M-l to A-20; M-l to G-19; M-l to L-18; M-l to 
R-17; MX to L-16; M-l to L-15; M-l to S-14; M-l to P-13; M-l to L-12; M-l to A-l 1; 
M-l to W-10; M-l to A-9; M-l to L-8; M-l to L-7; M-l to M-6; of SEQ ID NO:36. 
Polypeptides encoded by these polynucleotides are also encompassed by the invention. 

In addition, the invention provides nucleic acid molecules having nucleotide 
sequences related to extensive portions of SEQ ID NO:36 which have been determined 
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from the following related cDNA genes: HBMTB79R (SEQ ID NO:l 15) and 
HCDDP40R (SEQ ID NO: 116). 

Based on the sequence similarity to the mouse tag7 and the PGRP-like domain, 
translation product of this gene is expected to share at least some biological activities 
with tag7 proteins, and specifically cytokine modulatory proteins. Such activities are 
known in the art, some of which are described elsewhere herein. Specifically, 
polynucleotides and polypeptides of the invention are also useful for modulating the 
differentiation of normal and malignant cells, modulating the proliferation and/or 
differentiation of cancer and neoplastic cells, and modulating the immune response. 
Polynucleotides and polypeptides of the invention may represent a diagnostic marker for 
hematopoietic and immune diseases and/or disorders. The full-length protein should be a 
secreted protein, based upon homology to the tag7 protein. Therefore, it is secreted into 
serum, urine, or feces and thus the levels is assayable from patient samples. Assuming 
specific expression levels are reflective of the presence of immune disorders, this protein 
would provide a convenient diagnostic for early detection. In addition, expression of this 
gene product may also be linked to the progression of immune diseases, and therefore 
may itself actually represent a therapeutic or therapeutic target for the treatment of 
cancer. 

Polynucleotides and polypeptides of the invention may play an important role in 
the pathogenesis of human cancers and cellular transformation, particularly those of the 
immune and hematopoietic systems. Polynucleotides and polypeptides of the invention 
may also be involved in the pathogenesis of developmental abnormalities based upon its 
potential effects on proliferation and differentiation of cells and tissue cell types. Due to 
the potential proliferating and differentiating activity of said polynucleotides and 
polypeptides, the invention is useful as a therapeutic agent in inducing tissue 
regeneration, for treating inflammatory conditions (e.g., inflammatory bowel syndrome, 
diverticulitis, etc.). 
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Moreover, the invention is useful in modulating the immune response to aberrant 
polypeptides, as may exist in rapidly proliferating cells and tissue cell types, particularly 
in adenocarcinoma cells, and other cancers. The translation product of this gene shares 
sequence homology with Tag7, which is a mouse cytokine that, in soluble form, triggers 
apoptosis in mouse L929 cells in vitro. 

The translation product of this gene also shares sequence homology with 
antimicrobial BGP-A, a bovine antimicrobial peptide from bovine neutrophils. Preferred 
polypeptides of this invention comprise residues 184 to 196 shown in SEQ ID NO: 36. 
This polypeptide is believed to be the active mature form of the translation product of 
this gene. 

This gene is expressed primarily in bone marrow and to a lesser extent in human 
chondrosarcoma and neutrophils. 

Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identification of the tissue(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditions which include, but are not 
limited to, infections, cancer, and disorders of the immune system. Similarly, 
polypeptides and antibodies directed to these polypeptides are useful in providing 
immunological probes for differential identification of the tissue(s) or cell type(s). For a 
number of disorders of the above tissues or cells, particularly of infected tissues and the 
immune system, expression of this gene at significantly higher or lower levels is 
routinely detected in certain tissues or cell types (e.g. immune, hematopoietic, and 
cancerous and wounded tissues) or bodily fluids (e.g., lymph, serum, plasma, urine, 
synovial fluid and spinal fluid) or another tissue or cell sample taken from an individual 
having such a disorder, relative to the standard gene expression level, i.e., the expression 
level in healthy tissue or bodily fluid from an individual not having the disorder. 

Preferred polypeptides of the present invention comprise immunogenic epitopes 
shown in SEQ ID NO: 36 as residues: Ala-63 to Asn-68, Ala-71 to Gln-81, Tyr-135 to 
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Thr-141, Leu- 167 to Gin- 174, Pro-191 to Pro- 196. Polynucleotides encoding said 
polypeptides are also provided. 


FEATURES OF PROTEIN ENCODED BY GENE NO: 9 


This invention relates to newly identified polynucleotides, polypeptides encoded 
by such polynucleotides, the use of such polynucleotides and polypeptides, as well as the 
production of such polynucleotides and polypeptides. The polypeptide of the present 
invention has been putatively identified as a human butyrophilin homolog derived from a 
human testes tumor cDNA library. The polypeptide of the present invention is sometimes 
hereafter referred to as "Butyrophlin and B7-like IgG superfamily receptor", and/or 
"BBIR II". The invention also relates to inhibiting the action of such polypeptides. 

Butyrophilin is a glycoprotein of the immunoglobulin superfamily that is secreted 
in association with the milk-fat~globule membrane from mammary epithelial cells. The 
butyrophilin gene appears to have evolved from a subset of genes in the immunoglobulin 
superfamily and genes encoding the B30.2 domain, which is conserved in a family of 
zinc-finger proteins. Furthermore, expression analysis of butyrophilin genes has shown 
that butyrophilin expression increases during lactation in conjunction with an increase in 
milk fat content. These results suggest that the stage-specific expression of milk fat 
globule membrane glycoproteins in mammary epithelial cells is regulated in a similar but 
not necessarily identical mechanism to that of a major milk protein, beta-casein. 

The polypeptide of the present invention has been putatively identified as a 
member of the milk fat globule membrane glycoprotein family, and more particularly the 
butyrophilin family, and has been termed Butyrophlin and B7-like IgG superfamily 
receptor ("BBIR II"). This identification has been made as a result of amino acid 
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sequence homology to the bovine butyrophiJin precursor (See Genbank Accession No. 
gi|162773). 

Preferred polypeptides of the invention comprise the following nucleic acid 
sequence; 

ACATCCATGGCTCTAATGCTCAGTTTGGTTCTGAGTCTCCTCAAGCTGGGATC 

AGGGCAGTGGCAGGTGTTTGGGCCAGACAAGCCTGTCCAGGCCTTGGTGGGG 

GAGGACGCAGCATTCTCCTGTTTCCTGTCTCCTAAGACCAATGCAGAGGCCA 

TGGAAGTGCGGTTCTTCAGGGGCCAGTTCTCTAGCGTGGTCCACCTCTACAG 

GGACGGGAAGGACCAGCCATTTATGCAGATGCCACAGTATCAAGGCAGGAC 

AAAACTGGTGAAGGATTCTATTGCGGAGGGGCGCATCTCTCTGAGGCTGGAA 

AACATTACTGTGTTGGATGCTGGCCTCTATGGGTGCAGGATTAGTTCCCAGTC 

TTACTACCAGAAGGCCATCTGGGAGCTACAGGTGTCAGCACTGGGCTCAGTT 

CCTCTCATTTCCATCACGGGATATGTTGATAGAGACATCCAGCTACTCTGTCA 

GTCCTCGGGCTGGTTCCCCCGGCCCACAGCGAAGTGGAAAGGTCCACAAGGA 

CAGGATTTGTCCACAGACTCCAGGACAAACAGAGACATGCATGGCCTGTTTG 

ATGTGGAGATCTCTCTGACCGTCCAAGAGAACGCCGGGAGCATATCCTGTTC 

CATGCGGCATGCTCATCTGAGCCGAGAGGTGGAATCCAGGGTACAGATAGG 

AGATACCTTTTTCGAGCCTATATCGTGGCACCTGGCTACCAAAGTACTGGGA 

ATACTCTGCTGTGGCCTATTTTTTGGCATTGTTGGACTGAAGATTTTCTTCTCC 

AAATTCCAGTGGAAAATCCAGGCGGAACTGGACTGGAGAAGAAAGCACGGA 

CAGGCAGAATTGAGAGACGCCCGGAAACACGCAGTGGAGGTGACTCTGGAT 

CCAGAGACGGCTCACCCGAAGCTCTGCGTTTCTGATCTGAAAACTGTAACCC 

ATAGAAAAGCTCCCCAGGAGGTGCCTCACTCTGAGAAGAGATTTACAAGGA 

AGAGTGTGGTGGCTTCTCAGAGTTTCCAAGCAGGGAAACATTACTGGGAGGT 

GGACGGAGGACACAATAAAAGGTGGCGCGTGGGAGTGTGCCGGGATGATGT 

GGACAGGAGGAAGGAGTACGTGACTTTGTCTCCCGATCATGGGTACTGGGTC 

CTCAGACTGAATGGAGAACATTTGTATTTCACATTAAATCCCCGTTTTATCAG 
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CGTCTTCCCCAGGACCCCACCTACAAAAATAGGGGTCTTCCTGGACTATGAG 

TGTGGGACCATCTCCTTCTTCAACATAAATGACCAGTCCCTTATTTATACCCT 

GACATGTCGGTTTGAAGGCTTATTGAGGCCCTACATTGAGTATCCGTCCTATA 

ATGAGCAAAATGGAACTCCCAGAGACAAGCAACAGTGAGTCCTCCTCACAG 

GCAACCACGCCCTTCCTCCCCAGGGGTGAAATGTAGGATGAATCACATCCCA 

CATTCTTCTTTAGGGATATTAAGGTCTCTCTCCCAGATCCAAAGTCCCGCAGC 

AGCCGGCCAAGGTGGCTTCCAGATGAAGGGGGACTGGCCTGTCCACATGGG 

AGTCAGGTGTCATGGCTGCCCTGAGCTGGGAGGGAAGAAGGCTGACATTAC 

ATTTAGTTTGCTCTCACTCCATCTGGCTAAGTGATCTTGAAATACCACCTCTC 

AGGTGAAGAACCGTCAGGAATTCCCATCTCACAGGCTGTGGTGTAGATTAAG 

TAGACAAGGAATGTGAATAATGCTTAGATCTTATTGATGACAGAGTGTATCC 

T A ATGGTTTGTTC ATT AT ATT A C A CTTTC AGT A A A A A A A A AAAAAAAAAAAA 

AAAAAAAAAAAA (SEQ ID NO:117), and/or 

ATGGCTCTAATGCTCAGTTTGGTTCTGAGTCTCCTCAAGCTGGGATCAGGGCA 

GTGGCAGGTGTTTGGGCCAGACAAGCCTGTCCAGGCCTTGGTGGGGGAGGAC 

GCAGCATTCTCCTGTTTCCTGTCTCCTAAGACCAATGCAGAGGCCATGGAAG 

TGCGGTTCTTCAGGGGCCAGTTCTCTAGCGTGGTCCACCTCTACAGGGACGG 

GAAGGACCAGCCATTTATGCAGATGCCACAGTATCAAGGCAGGACAAAACT 

GGTGAAGGATTCTATTGCGGAGGGGCGCATCTCTCTGAGGCTGGAAAACATT 

ACTGTGTTGGATGCTGGCCTCTATGGGTGCAGGATTAGTTCCCAGTCTTACTA 

CCAGAAGGCCATCTGGGAGCTACAGGTGTCAGCACTGGGCTCAGTTCCTCTC 

ATTTCCATCACGGGATATGTTGATAGAGACATCCAGCTACTCTGTCAGTCCTC 

GGGCTGGTTCCCCCGGCCCACAGCGAAGTGGAAAGGTCCACAAGGACAGGA 

TTTGTCCACAGACTCCAGGACAAACAGAGACATGCATGGCCTGTTTGATGTG 

GAGATCTCTCTGACCGTCCAAGAGAACGCCGGGAGCATATCCTGTTCCATGC 

GGCATGCTCATCTGAGCCGAGAGGTGGAATCCAGGGTACAGATAGGAGATA 

CCTTTTTCGAGCCTATATCGTGGCACCTGGCTACCAAAGTACTGGGAATACTC 
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TGCTGTGGCCTATTTTTTGGCATTGTTGGACTGAAGATTTTCTTCTCCAAATTC 

CAGTGGAAAATCCAGGCGGAACTGGACTGGAGAAGAAAGCACGGACAGGCA 

GAATTGAGAGACGCCCGGAAACACGCAGTGGAGGTGACTCTGGATCCAGAG 

ACGGCTCACCCGAAGCTCTGCGTTTCTGATCTGAAAACTGTAACCCATAGAA 

AAGCTCCCCAGGAGGTGCCTCACTCTGAGAAGAGATTTACAAGGAAGAGTGT 

GGTGGCTTCTCAGAGTTTCCAAGCAGGGAAACATTACTGGGAGGTGGACGGA 

GGACACAATAAAAGGTGGCGCGTGGGAGTGTGCCGGGATGATGTGGACAGG 

AGGAAGGAGTACGTGACTTTGTCTCCCGATCATGGGTACTGGGTCCTCAGAC 

TGAATGGAGAACATTTGTATTTCACATTAAATCCCCGTTTTATCAGCGTCTTC 

CCCAGGACCCCACCTACAAAAATAGGGGTCTTCCTGGACTATGAGTGTGGGA 

CCATCTCCTTCTTCAACATAAATGACCAGTCCCTTATTTATACCCTGACATGT 

CGGTTTGAAGGCTTATTGAGGCCCTACATTGAGTATCCGTCCTATAATGAGC 

AAAATGGAACTCCCAGAGACAAGCAACAGTGA (SEQ ID NO: 11 8). Polypeptide 

encoded by these polynucleotides are also provided. 

Preferred polypeptides of the invention comprise the following amino acid 
sequence: 

MALMLSLVLSLLKLGSGQWQVFGPDKPVQALVGEDAAFSCFLSPKTNAEAMEV 

RFFRGQFSSVVHLYRDGKDQPFMQMPQYQGRTKLVKDSIAEGRISLRLENITVL 

DAGLYGCRISSQSYYQKAIWELQVSALGSVPLISITGYVDRDIQLLCQSSGWFPRP 

TAKWKGPQGQDLSTDSRTNRDMHGLFDVEISLTVQENAGSISCSMRHAHLSREV 

ESRVQIGDTFFEPISWHLATKVLGILCCGLFFGIVGLKIFFSKFQWK1QAELDWRR 

KHGQAELRDARKHAVEVTLDPETAHPKLCVSDLKTVTHRKAPQEVPHSEKRFT 

RKSVVASQSFQAGKHYWEVDGGHNKRWRVGVCRDDVDRRKEYVTLSPDHGY 

WVLRLNGEHLYFTLNPRFISVFPRTPPTKIGVFLDYECGTISFFNINDQSLIYTLTC 

RFEGLLRPYIEYPSYNEQNGTPRDKQQ (SEQ ID NO: 119). Polynucleotides encoding 

these polypeptides are also provided. 
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A preferred polynucleotide splice variant of the invention comprises the 
following nucleic acid sequence; 

ACCTTTTTCGAGCCTATATCGTGGCACCTGGCTACCAAAGTACTGGGAATACT 

CTGCTGTGGCCTATTTTTTGGCATTGTTGGACTGAAGATTTTCTTCTCCAAATT 

CCAGTGGAAAATCCAGGCGGAACTGGACTGGAGAAGAAAGCACGGACAGGC 

AGAATTGAGAGACGCCCGGAAACACGCAGTGGAGGTGACTCTGGATCCAGA 

GACGGCTCACCCGAAGCTCTGCGTTTCTGATCTGAAAACTGTAACCCATAGA 

AAAGCTCCCCAGGAGGTGCCTCACTCTGAGAAGAGATTTACAAGGAAGAGT 

GTGGTGGCTTCTCAGAGTTTCCAAGCAGGGAAACATTACTGGGAGGTGGACG 

GAGGACACAATAAAAGGTGGCGCGTGGGAGTGTGCCGGGATGATGTGGACA 

GGAGGAAGGAGTACGTGACTTTGTCTCCCGATCATGGGTACTGGGTCCTCAG 

A CTG A ATGG A G A AC ATTTGT ATTTC AC ATT A A ATCCCCGTTTT ATC AG CGTCT 

TCCCCAGGACCCCACCTACAAAAATAGGGGTCTTCCTGGACTATGAGTGTGG 

GACCATCTCCTTCTTCAACATAAATGACCAGTCCCTTATTTATACCCTGACAT 

GTCGGTTTGAAGGCTTATTGAGGCCCTACATTGAGTATCCGTCCTATAATGAG 

CAAAATGGAACTCCCAGAGACAAGCAACAGTGAGTCCTCCTCACAGGCAAC 

CACGCCCTTCCTCCCCAGGGGTGAAATGTAGGATGAATCACATCCCACATTC 

TTCTTTAGGGATATTAAGGTCTCTCTCCCAGATCCAAAGTCCCGCAGCAGCCG 

GCCAAGGTGGCTTCCAGATGAAGGGGGACTGGCCTGTCCACATGGGAGTCA 

GGTGTCATGGCTGCCCTGAGCTGGGAGGGAAGAAGGCTGACATTACATTTAG 

TTTGCTCTCACTCCATCTGGCTAAGTGATCTTGAAATACCACCTCTCAGGTGA 

AGAACCGTCAGGAATTCCCATCTCACAGGCTGTGGTGTAGATTAAGTAGACA 

AGGAATGTGAATAATGCTTAGATCTTATTGATGACAGAGTGTATCCTAATGG 

TTTGTTCATTATATTACACTTTCAGTAAAAAAAAAAAAAAAAAAAAAAAAAA 

AAAAAA (SEQ ID NO: 120). Polypeptides encoded by these polynucleotides are also 

provided. 
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Figures 22A-D show the nucleotide (SEQ ID NO: 19) and deduced amino acid 
sequence (SEQ ID NO:37) of BBIR II. Predicted amino acids from about 1 to about 17 
constitute the predicted signal peptide (amino acid residues from about 1 to about 17 in 
SEQ ID NO: 37) and are represented by the underlined amino acid regions. 
5 Figure 23 shows the regions of similarity between the amino acid sequences of 

the Butyrophlin and B7-like lgG superfamily receptor (BBIR II) protein (SEQ ID 
NO:37) and the bovine butyrophilin precursor (SEQ ID NO:121) 

Figure 24 shows an analysis of the integrin alpha 11 subunit (BBIR II) amino 
acid sequence. 

10 Alpha, beta, turn and coil regions; hydrophilicity and hydrophobicity; 

amphipathic regions; flexible regions; antigenic index and surface probability are shown. 

A polynucleotide encoding a polypeptide of the present invention is obtained 
from human small intestine, colon tumor, and human testes tumor cells and tissues. The 
polynucleotide of this invention was discovered in a human testes tumor cDNA library. 

15 Its translation product has homology to the B30.2-like domain which is 

characteristic of proteins containing zinc-binding B-box motifs, and particularly for 
butyrophilin family members. The polynucleotide contains an open reading frame 
encoding the BBIR II polypeptide of 318 amino acids. BBIR II exhibits a high degree of 
homology at the amino acid level to the bovine butyrophilin precursor (as shown in 

20 Figure 23). The present invention provides isolated nucleic acid molecules comprising a 
polynucleotide encoding the BBIR II polypeptide having the amino acid sequence shown 
in Figures 22A-D (SEQ ID NO:37). The nucleotide sequence shown in Figures 22A-D 
(SEQ ID NO: 19) was obtained by sequencing a cloned cDNA (HTTDB46), which was 
deposited on November 17 at the American Type Culture Collection, and given 

25 Accession Number 2034S4. 

The present invention is further directed to fragments of the isolated nucleic acid 
molecules described herein. By a fragment of an isolated DNA molecule having the 
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nucleotide sequence of the deposited cDNA or the nucleotide sequence shown in SEQ ID 
NO: 19 is intended DNA fragments at least about 15nt, and more preferably at least about 
20 nt, still more preferably at least about 30 nt, and even more preferably, at least about 
40 nt in length which are useful as diagnostic probes and primers as discussed herein. Of 
5 course, larger fragments 50-1500 nt in length are also useful according to the present 
invention, as are fragments corresponding to most, if not all, of the nucleotide sequence 
of the deposited cDNA or as shown in SEQ ID NO: 19. By a fragment at least 20 nt in 
length, for example, is intended fragments which include 20 or more contiguous bases 
from the nucleotide sequence of the deposited cDNA or the nucleotide sequence as 

10 shown in SEQ ID NO: 19. In this context "about" includes the particularly recited size, 
larger or smaller by several (5, 4, 3, 2, or 1) nucleotides, at either terminus or at both 
termini. Representative examples of BBIR II polynucleotide fragments of the invention 
include, for example, fragments that comprise, or alternatively, consist of, a sequence 
from about nucleotide 1 to about 50, from about 51 to about 100, from about 101 to 

15 about 150, from about 151 to about 200, from about 201 to about 250, from about 251 to 
about 300, from about 301 to about 350, from about 351 to about 400, from about 401 to 
about 450, from about 451 to about 500, from about 501 to about 550, from about 551 to 
about 600, from about 601 to about 650, from about 651 to about 700, from about 701 to 
about 750, from about 751 to about 800, from about 801 to about 850, from about 851 to 

20 about 900, from about 901 to about 950, from about 951 to about 1000, from about 1001 
to about 1050, from about 1051 to about 1 100, from about 1 101 to about 1 150, from 
about 1 151 to about 1200, from about 1201 to about 1250, from about 1251 to about 
1300, from about 1301 to about 1350, from about 1351 to about 1400, from about 1401 
to about 1450, from about 1451 to about 1500, from about 1501 to about 1550, from 

25 about 1551 to about 1600, from about 1601 to about 1650, from about 1651 to about 

1700, from about 1701 to about 1750, from about 1751 to about 1800, from about 1801 
to about 1850, from about 1851 to about 1900, from about 1901 to about 1950, from 
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about 1951 to about 2000, from about 2001 to about 2050, from about 2051 to about 
2100, from about 2101 to about 2150, from about 2151 to about 2200, from about 2201 
to about 2250, from about 2251 to about 2300, from about 2301 to about 2350, from 
about 2351 to about 2400, from about 2401 to about 2450, from about 2451 to about 
5 2500, from about 2501 to about 2550, from about 2551 to about 2600, from about 2601 
to about 2650, from about 2651 to about 2700, from about 2701 to about 2750, from 
about 2751 to about 2800, from about 2801 to about 2850, from about 2851 to about 
2900, from about 2901 to about 2950, from about 2951 to about 3000, from about 3001 
to about 3050, from about 3051 to about 3059 of SEQ ID NO: 19, or the complementary 

10 strand thereto, or the cDNA contained in the deposited gene. In this context "about" 
includes the particularly recited ranges, larger or smaller by several (5, 4, 3, 2, or 1) 
nucleotides, at either terminus or at both termini. 

Preferred nucleic acid fragments of the present invention include nucleic acid 
molecules encoding a member selected from the group: a polypeptide comprising or 

15 alternatively, consisting of, the mature BBIR II protein (amino acid residues from about 
18 to about 318 in Figures 22A-D (amino acids from about 18 to about 318 in SEQ ID 
NO:37). Since the location of this form of the protein has been predicted by computer 
analysis, one of ordinary skill would appreciate that the amino acid residues constituting 
these domains may vary slightly (e.g., by about 1 to 15 amino acid residues) depending 

20 on the criteria used to define this location. In additional embodiments, the 
polynucleotides of the invention encode functional attributes of BBIR II. 

Preferred embodiments of the invention in this regard include fragments that 
comprise alpha-helix and alpha-helix forming regions ("alpha-regions"), beta-sheet and 
beta-sheet forming regions ("beta-regions"), turn and turn-forming regions ("turn- 

25 regions"), coil and coil-forming regions ("coil-regions"), hydrophilic regions, 

hydrophobic regions, alpha amphipathic regions, beta amphipathic regions, flexible 
regions, surface-forming regions and high antigenic index regions of BBIR II. The data 
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representing the structural or functional attributes of BBIR II set forth in Figure 24 
and/or Table VIII, as described above, was generated using the various modules and 
algorithms of the DNA*STAR set on default parameters. In a preferred embodiment, the 
data presented in columns VIII, IX, XIII, and XIV of Table VIII can be used to 
determine regions of BBIR II which exhibit a high degree of potential for antigenicity. 
Regions of high antigenicity are determined from the data presented in columns VIII, IX, 
XIII, and/or XIV by choosing values which represent regions of the polypeptide which 
are likely to be exposed on the surface of the polypeptide in an environment in which 
antigen recognition may occur in the process of initiation of an immune response. 

Certain preferred regions in these regards are set out in Figure 24, but may, as 
shown in Table VIII, be represented or identified by using tabular representations of the 
data presented in Figure 24. The DNA*STAR computer algorithm used to generate 
Figure 24 (set on the original default parameters) was used to present the data in Figure 
24 in a tabular format (See Table VIII). The tabular format of the data in Figure 24 is 
used to easily determine specific boundaries of a preferred region. The above-mentioned 
preferred regions set out in Figure 24 and in Table VIII include, but are not limited to, 
regions of the aforementioned types identified by analysis of the amino acid sequence set 
out in Figures 22A-D. As set out in Figure 24 and in Table VIII, such preferred regions 
include Garni er-Robson alpha-regions, beta-regions, turn-regions, and coil-regions, 
Chou-Fasman alpha-regions, beta-regions, and turn-regions, Kyte-Doolittle hydrophilic 
regions and Hopp-Woods hydrophobic regions, Eisenberg alpha- and beta- amphi path ic 
regions, Karplus-Schulz flexible regions, Jameson-Wolf regions of high antigenic index 
and Emini surface -forming regions. Even if deletion of one or more amino acids from the 
N-terminus of a protein results in modification of loss of one or more biological 
functions of the protein, other functional activities (e.g., biological activities, ability to 
multimerize, etc.) may still be retained. For example, the ability of shortened BBIR II 
muteins to induce and/or bind to antibodies which recognize the complete or mature 
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forms of the polypeptides generally will be retained when less than the majority of the 
residues of the complete or mature polypeptide are removed from the N-terminus. 
Whether a particular polypeptide lacking N-terminal residues of a complete polypeptide 
retains such immunologic activities can readily be determined by routine methods 
described herein and otherwise known in the art. It is not unlikely that an BBIR II mutein 
with a large number of deleted N-terminal amino acid residues may retain some 
biological or immunogenic activities. In fact, peptides composed of as few as six BBIR II 
amino acid residues may often evoke an immune response. 

Accordingly, the present invention further provides polypeptides having one or 
more residues deleted from the amino terminus of the BBIR II amino acid sequence 
shown in Figures 22A-D, up to the cystein residue at position number 313 and 
polynucleotides encoding such polypeptides. In particular, the present invention provides 
polypeptides comprising the amino acid sequence of residues n 1-3 18 of Figures 22A-D, 
where n 1 is an integer from 2 to 313 corresponding to the position of the amino acid 
residue in Figures 22A-D (which is identical to the sequence shown as SEQ ID NO:37). 
In another embodiment, N-terminal deletions of the BBIR II polypeptide can be 
described by the general formula n2-318, where n2 is a number from 2 to 313, 
corresponding to the position of amino acid identified in Figures 22A-D. N-terminal 
deletions of the BBIR II polypeptide of the invention shown as SEQ ID NO:37 include 
polypeptides comprising the amino acid sequence of residues: N-terminal deletions of the 
BBIR II polypeptide of the invention shown as SEQ ID NO:37 include polypeptides 
comprising the amino acid sequence of residues: A-2 to T-318; L-3 to T-318; M-4 to T- 
318; L-5 to T-318; S-6to T-318; L-7 to T-318; V-8 to T-318; L-9 to T-318; S-10 to T- 
318; L-ll to T-318; L-12 to T-318; K-13 toT-318; L-14 to T-318; G-15 to T-318; S-16 
to T-318; G-17 to T-318; Q-18 to T-318; W-19 to T-318; Q-20 toT-318; V-21 to T-318; 
F-22 to T-318; G-23 to T-318; P-24 to T-318; D-25 to T-318; K-26 to T-318; P-27 toT- 
318; V-28 to T-318; Q-29 to T-318; A-30 to T-318; L-31 to T-318; V-32 to T-318; G-33 
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to T-318; E-34 toT-318; D-35 to T-318; A-36 to T-318; A-37 to T-318; F-38 to T-318; 
S-39 to T-318; C-40 to T-318; F-41 toT-318; L-42 to T-318; S-43 to T-318; P-44 to T- 
318; K-45 to T-318; T-46 to T-318; N-47 to T-318; A-48 toT-318; E-49 to T-318; A-50 
to T-318; M-51 to T-318; E-52 to T-318; V-53 to T-318; R-54 to T-318; F-55 toT-318; 
F-56 to T-318; R-57 to T-318; G-58 to T-318; Q-59 to T-318; F-60 to T-318; S-61 to T- 
318; S-62 toT-318; V-63 to T-318; V-64 to T-318; H-65 to T-318; L-66 to T-318; Y-67 
to T-318; R-68 to T-318; D-69 toT-3 18; G-70 to T-318; K-71 to T-318; D-72 to T-318; 
Q-73 to T-318; P-74 to T-318; F-75 to T-318; M-76 toT-318; Q-77 to T-318; M-78 to T- 
318; P-79 to T-318; Q-80 to T-318; Y-81 to T-318; Q-82 to T-318; G-83 toT-318; R-84 
to T-318; T-85 to T-318; K-86 to T-318; L-87 to T-318; V-88 to T-318: K-89 to T-318; 
D-90 toT-318; S-91 to T-318; 1-92 to T-318; A-93 to T-318; E-94 to T-318; G-95 to T- 
318; R-96 to T-318; 1-97 toT-318; S-98 to T-318; L-99 to T-318; R-100 to T-318; L-101 
to T-318; E-102 to T-318; N-103 to T-318;I-104 to T-318; T- 105 to T-318; V-106 to T 
318; L-107 to T-318; D-108 to T-318; A-109 to T-318; G-l 10 toT-318; L-l 1 1 to T-318; 
Y-112 to T-318; G-113 to T-318; C-l 14 to T-318; R-l 15 to T-318; 1-116 to T-318;S-1 17 
to T-318; S-118 to T-318; Q-l 19 to T-318; S-120 to T-318; Y-121 to T-318; Y-122 to T- 
318; Q- 123 toT-318; K-124 to T-318; A- 125 to T-318; 1-126 to T-318; W-127 to T-318; 
E-128 to T-318; L-129 to T-318;Q-130 to T-318; V-131 to T-318; S-132 to T-318; A- 
133 to T-318; L-134 to T-318; G-135 to T-318; S-136 toT-318; V-137 to T-318; P-138 
to T-318; L-139 to T-318; 1-140 to T-318; S-141 to T-318; 1-142 to T-318;A-143 to T- 
318; G-l 44 to T-318; Y-145 to T-318; V-146 to T-318; D-147 to T-318; R-148 to T-318; 
D-149to T-318; 1-150 to T-318; Q-151 to T-318; L-152 to T-318; L-153 to T-318; C-154 
to T-318; Q-l 55 to T-318;S-156 to T-318; S-157 to T-318; G-158 to T-318; W-159 to T- 
318; F-160 to T-318; P-161 to T-318; R-162to T-318; P-163 to T-318; T- 164 to T-318; 
A-165 to T-318; K-166 to T-318; W-167 to T-318; K-168 toT-318; G-169 to T-318; P- 
170 to T-318; Q-171 to T-318; G-172 to T-318; Q-173 to T-318; D-174 to T-318;L-175 
to T-318; S-176 to T-318; T-177 to T-318; D-178 to T-318; S-179 to T-318; R-180 to T- 
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318; T-181 toT-318; N-182 to T-318; R-183 to T-318; D-184 to T-318; M-185 to T-318; 
H-186 to T-318; G-187 to T-318;L-188 to T-318; F-189 to T-318; D 190 to T-318; V- 
191 to T-318; E-192 to T-318; 1-193 to T-318; S-194 toT-318; L-195 to T-318; T-196 to 
T-318; V-197 to T-318; Q-198 to T-318; E-199 to T-318; N-200 to T-318;A-201 to T- 
318; G-202 to T-3 1 8 ; S-203 to T-318; 1-204 to T-3 1 8 ; S-205 to T-318; C-206 to T-3 1 8 ; 
S-207 toT-318; M-208 to T-318; R-209 to T-318; H-210 to T-318; A-211 to T-318; H- 
212 to T-318; L-213 to T-318;S-214 to T-318; R-215 to T-318; E-216 to T-318; V-217 
to T-318; E-218 to T-318; S-219 to T-318; R-220 toT-318; V-221 to T-318; Q-222 to T- 
318; 1-223 to T-318; G-224 to T-318; D-225 to T-318; W-226 to T-318;R-227 to T-318; 
R-228 to T-318; K-229 to T-318; H-230 to T-318; G-231 to T-318; Q-232 to T-318; A- 
233to T-318; G-234 to T-318; K-235 to T-318; R-236 to T-318; K-237 to T-318; Y-238 
to T-318; S-239 toT-318; S-240 to T-318; S-241 to T-318; H-242 to T-318; 1-243 to T- 
318; Y-244 to T-318; D-245 to T-318;S-246 to T-318; F-247 to T-318; P-248 to T-318; 
S-249 to T-318; L-250 to T-318; S-251 to T-318; F-252 toT-318; M-253 to T-318; D- 
254 to T-318; F-255 to T-318; Y-256 to T-318; 1-257 to T-318; L-258 to T-318;R-259 to 
T-318; P-260 to T-318; V-261 to T-318; G-262 to T-318; P-263 to T-318; C-264 to T- 
318; R-265to T-318; A-266 to T-318; K-267 to T-318; L-268 to T-318; V-269 to T-318; 
M-270 to T-318; G-271 toT-318; T-272 to T-318; L-273 to T-318; K-274 to T-318; L- 
275 to T-318; Q-276 to T-318; 1-277 to T-318;L-278 to T-318; G-279 to T-318; E 280 to 
T-318; V-281 to T-318; H-282 to T-318; F-283 to T-318; V-284to T-318; E-285 to T- 
318; K-286 to T-318; P-287 to T-318; H-288 to T-318; S-289 to T-318; L-290 toT-318; 
L-291 to T-318; Q-292 to T-318; 1-293 to T-318; S-294 to T-318; G-295 to T-318; G- 
296 to T-318;S-297 to T-318; T-298 to T-318; T-299 to T-318; L-300 to T-318; K-301 to 
T-318; K-302 to T-318; G-303to T-318; P-304 to T-318; N-305 to T-318; P-306 to T- 
318; W-307 to T-318; S-308 to T-318; F-309 toT-318; P-310 to T-318; S-311 to T-318; 
P-312 to T-318; C-313 to T-318; of SEQ ID NO:37. Polypeptides encoded by these 
polynucleotides are also encompassed by the invention. 
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Also as mentioned above, even if deletion of one or more amino acids from the 
C-terminus of a protein results in modification or loss of one or more biological 
functions of the protein, other functional activities (e.g., biological activities (e.g., ability 
to illicit mitogenic activity, induce differentiation of normal or malignant cells, ability to 
5 multimerize, etc.) may still be retained. For example the ability of the shortened BBIR II 
mutein to induce and/or bind to antibodies which recognize the complete or mature forms 
of the polypeptide generally will be retained when less than the majority of the residues 
of the complete or mature polypeptide are removed from the C-terminus. Whether a 
particular polypeptide lacking C-terminal residues of a complete polypeptide retains such 

10 immunologic activities can readily be determined by routine methods described herein 
and otherwise known in the art. It is not unlikely that an BBIR II mutein with a large 
number of deleted C-terminal amino acid residues may retain some biological or 
immunogenic activities. In fact, peptides composed of as few as six BBIR II amino acid 
residues may often evoke an immune response. 

15 Accordingly, the present invention further provides polypeptides having one or 

more residues deleted from the carboxy terminus of the amino acid sequence of the 
BBIR II polypeptide shown in Figures 22A-D, up to the serine residue at position 
number 6, and polynucleotides encoding such polypeptides. In particular, the present 
invention provides polypeptides comprising the amino acid sequence of residues 1-ml of 

20 Figure 1 , where ml is an integer from 6 to 3 1 8 corresponding to the position of the 

amino acid residue in Figures 22A-D. Moreover, the invention provides polynucleotides 
encoding polypeptides comprising, or alternatively consisting of, the amino acid 
sequence of C-terminal deletions of the BBIR II polypeptide of the invention shown as 
SEQ ID NO:37 include polypeptides comprising the amino acid sequence of residues M- 

25 1 toP-317;M-l toF-316;M-l toL-315;M-l to A-314; M-l toC-313;M-l toP-312;M- 
1 to S-311; M-l to P-310; M-l to F-309; M-l to S-308; M-lto W-307; M-l to P-306; M- 
1 to N-305; M-l to P-304; M-l to G-303; M-l to K-302; M-l to K-301; M-l toL-300; 
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M-l to T-299; M-l to T-298; M-l to S-297; M-l to G-296; M-l to G-295; M-l to S-294; 
M-l toI-293; M-l to Q-292; M-l to L-291; M-l to L-290; M-l to S-289; M-l to H-2S8; 
M-l to P-287; M-l toK-286; M-l to E-285; M-l to V-284; M-l to F-283; M-l to H-282; 
M-l to V-281; M-l to E-280; M-l toG-279; Ml to L-278; M-l to 1-277; M-l to Q-276; 
M-l to L-275; M-l to K-274; M-l to L-273; M-l toT-272; M-l to G-271; M-l to M-270; 
M-l to V-269; M-l to L-268; M-l to K-267; M-l to A-266; M-l toR-265; M-l to C-264; 
M-l to P-263; M-l to G-262; M-l to V-261; M-l to P-260; M-l to R-259; M-l toL-258; 
M-l to 1-257; M-l to Y-256; M-l to F-255; M-l to D-254; M-l to M-253; M-l to F-252; 
M-l toS-251; M-l to L-250; M-l to S-249; M-l to P-248; M-l to F-247; M-l to S-246; 
M-l to D-245; M-l toY-244; M-l to 1-243; M-i to H-242; M-l to S-241; M-l to S-240; 
M-l to S-239; M-l to Y-238; M-l toK-237; M-l to R-236; M-l to K-235; M-l to G-234; 
M-l to A-233; M-l to Q-232; M-l to G-231; M-l toH-230; M-l to K-229; M-l to R- 
228; M-l to R-227; M-l to W-226; M-l to D-225; M-l to G-224; M-l toI-223; M-l to 
Q-222; M-l to V-221; M-l to R-220; M-l to S-219; M-l to E-218; M-l to V-217; M-l 
toE-216; M-l toR-215;M-l to S-214; M-l toL-213;M-l to H-212; M-l to A-211;M-1 
to H-210; M-l toR-209; M-l to M-208; M-l to S-207; M-l to C-206; M-l to S-205; M-l 
to 1-204; M-l to S-203; M-l toG-202; M-l to A-201; M-l to N-200; M-l to E-199; M-l 
to Q-198;M-1 to V-197;M-1 toT-196;M-l toL-195; M-l to S-194; M-l to 1-193; M-l 
toE-192;M-l to V-191;M-1 toD-190;M-l toF-189; M-l toL-188; M-l to G-187 ; M-1 
to H-186;M-1 to M-185; M-l toD-184; M-l to R-183;M-1 to N-182;M-1 toT-181;M- 
1 to R-180; M-l to S-179; M-l toD-178; M-l toT-177;M-l to S-176; M-l toL-175; 
M l toD-174; M-l to Q-173; M-l to G-172; M-l to Q-171; M-l to P-170; M-l to G-169; 
M-l to K-168;M~1 toW-167;M-l to K-166; M-l to A-165; M-l toT-164; M-l to P- 
163; M-i to R-162; M-l to P-161; M-l toF-160; M-l to W-159; M-l to G-158; M-l to 
S-157; M-l to S-156; M-l to Q-155; M-l to C-154; M-l toL-153; M-l to L-152; M-l to 
Q-151; M-l to 1-150; M-l to D-149; M-l to R-148; M-l to D-147; M-l toV-146; M-i to 
Y-145; M-l to G-144; M-l to A-143; M-l to 1-142; M-l to S-141; M-l to 1-140; M-l 
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toL-139; M-l to P-138; M-l to V-137; M-l to S-136; M-l to G-135; M-l to L-134; M-l 
to A-133; M-l toS-132; M-l to V-131; M-l to Q-130; M-l to L-129; M-l to E-128; M-l 
to W-127; M-l to 1-126; M-l toA-125; M-l to K-124; M-l to Q-123; M-l to Y-122; M-l 
to Y-121; M-l to S-120; M-l to Q-119; M-l toS-118; M-l to S-l 17; M-l to 1-116; M-l 
to R-115; M-l to C-114;M-1 to G-113;M-1 to Y-112; M-l toL-111; M-l to G-J10; M-l 
to A-109; M l to 0-108; M-l to L-107; M-l to V-106;M-1 to T-105; M-l tol-104; M-l 
to N-103; M-l to E-102; M-l to L-101; M-l to R-100; M-l to L-99; M-l to S-98; M-l to 
1-97 ;M-1 to R-96; M-l to G-95; M-l to E-94; M-l to A-93; M-l to 1-92; M-l to S-91; 
M-l to D-90; M-l to K-89;M-1 to V-88; M-l to L-87; M-l to K-86; M-l to T-85; M-l to 
R-84; M-l to G-83; M-l to Q-82; M-l to Y-81;M-1 to Q 80; M-l to P-79; M-l to M-78; 
M-l to Q-77; M-l to M-76; M-l to F-75; M-l to P-74; M-l to Q-73;M-1 to D-72; M-l to 
K-71; M-l to G-70; M-l to D-69; M-l to R-68; M-l to Y-67; M-l to L-66; M-l to H- 
65;M-1 to V-64; M-l to V-63; M-l to S-62; M-l to S-61; M-l to F-60; M-l to Q-59; M- 
1 to G-58; M-l to R-57;M-1 to F-56; M-l to F-55; M-l to R-54; M-l to V-53; M-l to E- 
52; M-l to M-51; M-l to A-50; M-l to E-49;M-1 to A-48; M-l to N-47; M-l to T-46; 
M-l to K-45; M-l to P-44; M-l to S-43; M-l to L-42; M-l to F-41;M-1 to C-40; M-l to 
S-39; M-l to F-38; M-l to A-37; M-l to A-36; M-l to D-35; M-l to E-34; M-l to G- 
33;M-1 to V-32; M-l to L-31; M-l to A-30; M-l to Q-29; M-l to V-28; M-l to P-27; M- 
1 to K-26; M-l to D-25;M-1 to P-24; M-l to G-23; M-l to F-22; M-l to V-21; M-l to Q- 
20; M-l to W-19; M-l to Q-18; M-l to G-17;M-1 to S-16; M-l to G-15; M-l to L-14; 
M-l to K-13;M-1 toL~12;M-l to HI; M-l to S-10; M-l toL-9;M-l to V-8;M-1 to L- 
7; M-l to S-6; of SEQ ID NO:37. Polypeptides encoded by these polynucleotides are 
also encompassed by the invention. 

In addition, the invention provides nucleic acid molecules having nucleotide 
sequences related to extensive portions of SEQ ID NO: 19 which have been determined 
from the following related cDNA genes: HTTDB46R (SEQ ID NO: 122), and HSIEA44R 
(SEQ ID NO: 123). 
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Based on the sequence similarity to the bovin butyrophilin precursor, translation 
product of this gene is expected to share at least some biological activities with B30.2- 
like domain containing proteins, and specifically butyrophilin proteins. Such activities 
are known in the art, some of which are described elsewhere herein. Specifically, 
5 polynucleotides and polypeptides of the invention are also useful for modulating the 
differentiation of normal and malignant cells, modulating the proliferation and/or 
differentiation of cancer and neoplastic cells, and regulation of cell growth and 
differentiation. Polynucleotides and polypeptides of the invention may represent a 
diagnostic marker for breast diseases and/or disorders, in addition to disorders of 
10 secretory organs and tissues (which include, testicular and gastrointestinal disorders, 
particularly those cells which serve secretory functions for seminal fluid or 
gastrointestinal hormones, and disorders of the mucosal membranes of such cells and 
tissues, etc.). 

The full-length protein should be a secreted protein, based upon homology to the 
15 butyrophilin family of proteins. Therefore, it is secreted into milk, serum, urine, seminal 
fluid, or feces and thus the levels is assayable from patient samples. Assuming specific 
expression levels are reflective of the presence of breast disorders (i.e., breast cancer, 
breast dysfunction, etc.) this protein would provide a convenient diagnostic for early 
detection of such disorders 
20 In addition, expression of this gene product may also be linked to the progression 

of breast diseases, and therefore may itself actually represent a therapeutic or therapeutic 
target for the treatment of breast cancer. Polynucleotides and polypeptides of the 
invention may play an important role in the pathogenesis of human cancers and cellular 
transformation, particularly those of secretory cells and tissues. Polynucleotides and 
25 polypeptides of the invention may also be involved in the pathogenesis of developmental 
abnormalities based upon its potential effects on proliferation and differentiation of cells 
and tissue cell types. 
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Due to the potential proliferating and differentiating activity of said 
polynucleotides and polypeptides, the invention is useful as a therapeutic agent in 
inducing tissue regeneration, for treating inflammatory conditions. Moreover, the 
invention is useful in modulating the immune response to aberrant polypeptides, as may 
5 exist in rapidly proliferating cells and tissue cell types, particularly in cancers. The 
invention, including agonists and/or antagonists thereof, is useful in modulating the 
nutritional value of milk, its caloric content, its fat content, and may conceivably be 
useful in mediating the adaption of breast secretory function as a delivery vehicle for 
therapeutics (i.e., transgenic breast secretory tissue for transferring therapeutically active 

10 proteins to infants). 

Alternatively, the expression within cellular sources marked by proliferating cells 
indicates this protein may play a role in the regulation of cellular division, and may show 
utility in the diagnosis, treatment, and/or prevention of developmental diseases and 
disorders, including cancer, and other proliferative conditions. Representative uses are 

15 described in the "Hyperproliferative Disorders" and "Regeneration" sections below and 
elsewhere herein. Briefly, developmental tissues rely on decisions involving cell 
differentiation and/or apoptosis in pattern formation. 

Dysregulation of apoptosis can result in inappropriate suppression of cell death, 
as occurs in the development of some cancers, or in failure to control the extent of cell 

20 death, as is believed to occur in acquired immunodeficiency and certain 
neurodegenerative disorders, such as spinal muscular atrophy (SMA). 

Alternatively, this gene product is involved in the pattern of cellular proliferation 
that accompanies early embryogenesis. Thus, aberrant expression of this gene product in 
tissues - particularly adult tissues - may correlate with patterns of abnormal cellular 

25 proliferation, such as found in various cancers. Because of potential roles in proliferation 
and differentiation, this gene product may have applications in the adult for tissue 
regeneration and the treatment of cancers. It may also act as a morphogen to control cell 
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and tissue type specification. Therefore, the polynucleotides and polypeptides of the 
present invention are useful in treating, detecting, and/or preventing said disorders and 
conditions, in addition to other types of degenerative conditions. Thus this protein may 
modulate apoptosis or tissue differentiation and is useful in the detection, treatment, 
5 and/or prevention of degenerative or proliferative conditions and diseases. The protein is 
useful in modulating the immune response to aberrant polypeptides, as may exist in 
proliferating and cancerous cells and tissues. The protein can also be used to gain new 
insight into the regulation of cellular growth and proliferation. Furthermore, the protein 
may also be used to determine biological activity, to raise antibodies, as tissue markers, 

10 to isolate cognate ligands or receptors, to identify agents that modulate their interactions, 
in addition to its use as a nutritional supplement. Protein, as well as, antibodies directed 
against the protein may show utility as a tumor marker and/or immunotherapy targets for 
the above listed tissues. 

This gene is expressed primarily in small intestine, colon tumor, and to a lesser 

15 extent in human testes tumor cells. 

Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differentia] identification of the tissue(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditions which include, but are not 
limited to, gastrointestinal diseases and/or disorders, in addition to lactation disorders, 

20 and tumors of the testes. Similarly, polypeptides and antibodies directed to these 

polypeptides are useful in providing immunological probes for differential identification 
of the tissue(s) or cell type(s). For a number of disorders of the above tissues or cells, 
particularly of the immune and reproductive systems, expression of this gene at 
significantly higher or lower levels is routinely detected in certain tissues or cell types 

25 (e.g. immune, testicular, gastrointestinal, and cancerous and wounded tissues) or bodily 
fluids (e.g. lymph, serum, plasma, urine, synovial fluid and spinal fluid) or another tissue 
or cell sample taken from an individual having such a disorder, relative to the standard 
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gene expression level, i.e., the expression level in healthy tissue or bodily fluid from an 
individual not having the disorder. 

Preferred polypeptides of the present invention comprise immunogenic epitopes 
shown in SEQ ID NO: 37 as residues: Tyr-67 to Pro-74, Ser-117 to Gln-123, Pro-161 to 
Met- 185, Gly-224 to His-242, Thr-299 to Trp-307. Polynucleotides encoding said 
polypeptides are also provided. 


FEATURES OF PROTEIN ENCODED BY GENE NO: 10 

The translation product of this gene contains a serine protease motif and 
accordingly is believed to possess serine protease activity. Assays for determining such 
activity are well known in the art. Preferred polypeptides of this invention possess such 
activity. 

Included in this invention as preferred domains are serine protease histidine 
active site domains, which were identified using the ProSite analysis tool (Swiss Institute 
of Bioinformatics). The catalytic activity of the serine proteases from the trypsin family 
is provided by a charge relay system involving an aspartic acid residue hydrogen- bonded 
to a histidine, which itself is hydrogen-bonded to a serine. The sequences in the vicinity 
of the active site serine and histidine residues are well conserved in this family of 
proteases [1]. Consensus pattern: [LIVM]-[ST]-A-[STAG]-H-C , H is the active site 
residue. 

Preferred polypeptides of the invention comprise the following amino acid 
sequence: GTLVAEKHVLTAAHCIHDGKTYVKGTQ (SEQ ID NO: 124). 
Polynucleotides encoding these polypeptides are also provided. 

Further preferred are polypeptides comprising the serine protease histidine active 
site domain of the sequence referenced in Table XIII for this gene, and at least 5, 10, 15, 


SUBSTITUTE SHEET (RULE 26) 


WO 00/29435 PCT/US99/25031 

168 

20, 25, 30, 50, or 75 additional contiguous amino acid residues of this referenced 
sequence. The additional contiguous amino acid residues is N-terminal or C- terminal to 
the serine protease histidine active site domain. 

Alternatively, the additional contiguous amino acid residues is both N-terminal 
5 and C-terminal to the serine protease histidine active site domain, wherein the total N- 
and C-terminal contiguous amino acid residues equal the specified number. The above 
preferred polypeptide domain is characteristic of a signature specific to serine protease 
proteins. Based on the sequence similarity, the translation product of this gene is 
expected to share at least some biological activities with serine proteases. Such activities 
10 are known in the art, some of which are described elsewhere herein. 

In another embodiment, polypeptides comprising the amino acid sequence of the 
open reading frame upstream of the predicted signal peptide are contemplated by the 
present invention. Specifically, polypeptides of the invention comprise the following 
amino acid sequence: 

15 GTRGQAWEPRALSRRPHLSERRSEPRPGRAARRGTVLGMAGIPGLLFLLFF 

LLCAVGQVSPYSAPWKPTWPAYRLPVVLPQSTLNLAKPDFGAEAKLEVSSSCGP 
QCHKGTPLPTYEEAKQYLSYETLYANGSRTETQVGIYILSSSGDGAQHRDSGSSG 
KSRRKRQIYGYDSRFSIFGKDFLLNYPFSTSVKLSTGCTGTLVAEKHVLTAAHC1 
HDGKTYVKGTQKLRVGFLKPKFKDGGRGANDSTSAMPEQMKFQWIRVKRTHV 

20 PKGWIKGNANDIGMDYDYALLELKKPHKRKFMKIGVSPPAKQLPGGRIHFSGYD 
NDRPGNLVYRFCDVKDETYDLLYQQCDSQPGASGSGVYVRMWKRQHQKWER 
KIIGMISGHQWVDMDGSPQEFTRGCSEITPLQYIPDISIGV (SEQ ID NO: 125). 
Polynucleotides encoding these polypeptides are also provided, 

A preferred polypeptide variant of the invention comprises the following amino 

25 acid sequence: 

MAGIPGLLFLLFFLLCAVGQVSPYSAPWKPTWPAYRLPVVLPQSTLNLAKPD 
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FGAEAKLEVSSSCGPQCHKGTPLPTYEEAKQYLSYETLYANGSRTETQVGIYILS 

SSGDGAQHRDSGSSGKSRRKRQIYGYDSRFSIFGKDFLLNYPFSTSVKLSTGCTG 
TLVAEKHVLT 

AAHCIHDGKTYVKGTQKLRVGFLKPKFKDGGRGANDSTSAMPEQMKFQWIRV 
5 KRTHVPKGWIKGNANDIGMDYDYALLELKKPHKRKFMKIGVSPPAKQLPGGRI 
HFSGYDNDRPGNLVYRFCDVKDETYDLLYQQCDAQPGASGSGVYVRMWKRQ 
QQKWERKIIGIFSGHQWVDMNGSPQDFNVAVRITPLKYAQICYWIKGNYLDCRE 
G (SEQ ID NO: 126). Polynucleotides encoding these polypeptides are also provided. 
Figures 25 A B show the nucleotide (SEQ ID NO:20) and deduced amino acid 
10 sequence (SEQ ID NO:38) of the present invention. Predicted amino acids from about 1 
to about 19 constitute the predicted signal peptide (amino acid residues from about 1 to 
about 19 in SEQ ID NO:38) and are represented by the underlined amino acid regions; 
amino acids from about 162 to about 188 constitutes the predicted serine protease 
histidine active site domain (amino acids residues from about 162 to about 188 in SEQ 
15 ID NO:38) and are represented by the double underlined amino acid regions; and amino 
acid residue 175 (amino acid residue 175 in SEQ ID NO:38) constitutes the predicted 
histidine active site residue and is represented by the bold amino acid. 

Figure 26 shows the regions of similarity between the amino acid sequences of 
the present invention SEQ ID NO:38, and the Human Pancreatic Elastase 2 protein 
20 (gi|219620)(SEQ ID NO: 127). 

Figure 27 shows an analysis of the amino acid sequence of SEQ ID NO:38. 
Alpha, beta, turn and coil regions; hydrophilicity and hydrophobicity; amphipathic 
regions; flexible regions; antigenic index and surface probability are shown. 

Northern analysis indicates that this gene is expressed highest in HUVEC, 
25 HUVEC +LPS, smooth muscle, fibroblasts, present in heart, brain, placenta, lung, liver, 
muscle, kidney, pancreas, spleen, thymus, prostate, testes, ovary, small intestine, colon 
and weakly in PBLs. 
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The present invention provides isolated nucleic acid molecules comprising a 
polynucleotide encoding the polypeptide having the amino acid sequence shown in 
Figures 25 A-B (SEQ ID NO:38), which was determined by sequencing a cloned cDNA. 
The nucleotide sequence shown in Figures 25 A-B (SEQ ID NO:20) was obtained by 
sequencing a cloned cDNA (HUSAQ05), which was deposited on Nov. 17, 1998 at the 
American Type Culture Collection, and given Accession Number 203484. The deposited 
gene is inserted in the pSport plasmid (Life Technologies, Rockville, MD) using the 
Sall/NotI restriction endonuclease cleavage sites. 

The present invention is further directed to fragments of the isolated nucleic acid 
molecules described herein. By a fragment of an isolated DNA molecule having the 
nucleotide sequence of the deposited cDNA or the nucleotide sequence shown in SEQ ID 
NO: 20 is intended DNA fragments at least about 15nt, and more preferably at least about 
20 nt, still more preferably at least about 30 nt, and even more preferably, at least about 
40 nt in length which are useful as diagnostic probes and primers as discussed herein. Of 
course, larger fragments 50-1500 nt in length are also useful according to the present 
invention, as are fragments corresponding to most, if not all, of the nucleotide sequence 
of the deposited cDNA or as shown in SEQ ID NO:20. By a fragment at least 20 nt in 
length, for example, is intended fragments which include 20 or more contiguous bases 
from the nucleotide sequence of the deposited cDNA or the nucleotide sequence as 
shown in SEQ ID NO:20. In this context "about" includes the particularly recited size, 
larger or smaller by several (5, 4, 3, 2, or 1) nucleotides, at either terminus or at both 
termini. Representative examples of polynucleotide fragments of the invention include, 
for example, fragments that comprise, or alternatively, consist of, a sequence from about 
nucleotide 1 to about 50, from about 51 to about 100, from about 101 to about 150, from 
about 151 to about 200, from about 201 to about 250, from about 251 to about 300, from 
about 301 to about 350, from about 351 to about 400, from about 401 to about 450, from 
about 451 to about 500, and from about 501 to about 550, and from about 551 to about 
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600, from about 601 to about 650, from about 651 to about 700, from about 701 to about 
750, from about 751 to about 800, from about 801 to about 850, from about 851 to about 
900, from about 901 to about 950, from about 951 to about 1000, from about 1001 to 
about 1050, from about 1051 to about 1100, from about 1101 to about 1150, from about 
1 151 to about 1200, from about 1201 to about 1250, from about 1251 to about 1300, 
from about 1301 to about 1350, from about 1351 to about 1400, from about 1401 to 
about 1450, from about 1451 to about 1500, from about 1501 to about 1550, from about 
1551 to about 1600, from about 1601 to about 1650, from about 1651 to about 1699 of 
SEQ ID NO: 20, or the complementary strand thereto, or the cDNA contained in the 
deposited gene. In this context "about" includes the particularly recited ranges, larger or 
smaller by several (5, 4, 3, 2, or 1) nucleotides, at either terminus or at both termini. In 
additional embodiments, the polynucleotides of the invention encode functional 
attributes of the corresponding protein. 

Preferred embodiments of the invention in this regard include fragments that 
comprise alpha-helix and alpha-helix forming regions ("alpha-regions"), beta-sheet and 
beta-sheet forming regions ("beta-regions"), turn and turn-forming regions ("turn- 
regions"), coil and coil-forming regions ("coil-regions 1 '), hydrophilic regions, 
hydrophobic regions, alpha amphipathic regions, beta amphipathic regions, flexible 
regions, surface-forming regions and high antigenic index regions. The data representing 
the structural or functional attributes of the protein set forth in Figure 27 and/or Table IX, 
as described above, was generated using the various modules and algorithms of the 
DNA*STAR set on default parameters. In a preferred embodiment, the data presented in 
columns VIII, IX, XIII, and XIV of Table IX can be used to determine regions of the 
protein which exhibit a high degree of potential for antigenicity. Regions of high 
antigenicity are determined from the data presented in columns VIII, IX, XIII, and/or 
XIV by choosing values which represent regions of the polypeptide which are likely to 
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be exposed on the surface of the polypeptide in an environment in which antigen 
recognition may occur in the process of initiation of an immune response. 

Certain preferred regions in these regards are set out in Figure 27, but may, as 
shown in Table IX, be represented or identified by using tabular representations of the 
data presented in Figure 27. The DNA*STAR computer algorithm used to generate 
Figure 27 (set on the original default parameters) was used to present the data in Figure 
27 in a tabular format (See Table IX). The tabular format of the data in Figure 27 is used 
to easily determine specific boundaries of a preferred region. The above-mentioned 
preferred regions set out in Figure 27 and in Table IX include, but are not limited to, 
regions of the aforementioned types identified by analysis of the amino acid sequence set 
out in Figure 1. As set out in Figure 27 and in Table IX, such preferred regions include 
Garnier-Robson alpha-regions, beta-regions, turn-regions, and coil-regions, Chou- 
Fasman aipha-regions, beta-regions, and turn-regions, Kyte-Doolittle hydrophilic regions 
and Hopp- Woods hydrophobic regions, Eisenberg alpha- and beta-amphipathic regions, 
Karplus-Schulz flexible regions, Jameson-Wolf regions of high antigenic index and 
Emini surface-forming regions. Even if deletion of one or more amino acids from the N- 
terminus of a protein results in modification of loss of one or more biological functions 
of the protein, other functional activities (e.g., biological activities, ability to 
multirnerize, etc.) may still be retained. For example, the ability of shortened muteins to 
induce and/or bind to antibodies which recognize the complete or mature forms of the 
polypeptides generally will be retained when less than the majority of the residues of the 
complete or mature polypeptide are removed from the N-terminus. Whether a particular 
polypeptide lacking N-terminal residues of a complete polypeptide retains such 
immunologic activities can readily be determined by routine methods described herein 
and otherwise known in the art. It is not unlikely that a mutein with a large number of 
deleted N-terminal amino acid residues may retain some biological or immunogenic 
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activities. In fact, peptides composed of as few as six amino acid residues may often 
evoke an immune response. 

Accordingly, the present invention further provides polypeptides having one or 
more residues deleted from the amino terminus of the amino acid sequence shown in 
5 Figures 25A-B, up to the aspartic acid residue at position number 370 and 

polynucleotides encoding such polypeptides. In particular, the present invention provides 
polypeptides comprising the amino acid sequence of residues nl-375 of Figures 25A-B, 
where nl is an integer from 2 to 370 corresponding to the position of the amino acid 
residue in Figures 25A-B (which is identical to the sequence shown as SEQ ID NO;38). 

10 N-terminal deletions of the polypeptide of the invention shown as SEQ ID NO: 3 8 

include polypeptides comprising the amino acid sequence of residues: A-2 to V-375; G-3 
to V-375; 1-4 to V-375; P-5 to V-375; G-6 to V-375; L-7 to V-375; L-8 to V-375; F-9 to 
V-375; L-10 to V-375; L-ll to V-375; F-12 to V-375; F-13 to V-375; L-14 to V-375; L- 
15 to V-375; C-16 to V-375; A-17 to V-375; V-18 to V-375; G-19 to V-375; Q-20 to V- 

15 375; V-21 to V-375; S-22 to V-375; P-23 to V-375; Y-24 to V-375; S-25 to V-375; A-26 
to V-375; P-27 to V-375; W-28 to V-375; K-29 to V-375; P-30 to V-375; T-31 to V-375; 
W-32 to V-375; P-33 to V-375; A-34 to V-375; Y-35 to V-375; R-36 to V-375; L-37 to 
V-375; P-38 to V-375; V-39 to V-375; V-40 to V-375; L-41 to V-375; P-42 to V-375; Q- 
43 to V-375; S-44 to V-375; T-45 to V-375; L-46 to V-375; N-47 to V-375; L-48 to V- 

20 375; A-49 to V-375; K-50 to V-375; P-51 to V-375; D-52 to V-375; F-53 to V-375; G-54 
to V-375; A-55 to V-375; E-56 to V-375; A-57 to V-375; K-58 to V-375; L-59 to V-375; 
E-60 to V-375; V-61 to V-375; S-62 to V-375; S-63 to V-375; S-64 to V-375; C-65 to V- 
375; G-66 to V-375; P-67 to V-375; Q-68 to V-375; C-69 to V-375; H-70 to V-375; K- 
71 to V-375; G-72 to V-375; T-73 to V-375; P-74 to V-375; L-75 to V-375; P-76 to V- 

25 375; T-77 to V-375; Y-78 to V-375; E-79 to V-375; E-80 to V-375; A-81 to V-375; K-82 
to V-375; Q-83 to V-375; Y-84 to V-375; L-85 to V-375; S-86 to V-375; Y-87 to V-375; 
E-88 to V-375; T-89 to V-375; L-90 to V-375; Y-91 to V-375; A-92 to V-375; N-93 to 
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V-375; G-94 to V-375; S-95 to V-375; R-96 to V-375; T-97 to V-375; E-98 to V-375; T- 
99 to V-375; Q-100 to V-375; V-101 to V-375; G-102 to V-375; I-I03 to V-375; Y-104 
to V-375; 1-105 to V-375; L-106 to V-375; S-107 to V-375; S-108 to V-375; S-109 to V- 
375; G-110 to V-375; D-lll to V-375; G-112 to V-375; A-113 to V-375; Q-114 to V- 
5 375; H-l 15 to V-375; R-l 16 to V-375; D-117 to V-375; S-l 18 to V-375; G-119 to V- 
375; S-120 to V-375; S-121 to V-375; G-I22 to V-375; K-123 to V-375; S-l 24 to V- 
375; R-125 to V-375; R-126 to V-375; K-127 to V-375; R-128 to V-375; Q-129 to V- 
375; 1-130 to V-375; Y-131 to V-375; G-132 to V-375; Y-133 to V-375; D-134 to V~ 
375; S-135 to V-375; R-l 36 to V-375; F-I37 to V-375; S-138 to V-375; 1-139 to V-375; 

10 F-140 to V-375; G-141 to V-375; K-142 to V-375; D-143 to V-375; F-144 to V-375; L- 
145 to V-375; L-146 to V-375; N-147 to V-375; Y-148 to V-375; P-149 to V-375; F-150 
to V-375; S-151 to V-375; T-152 to V-375; S-153 to V-375; V-154 to V-375; K-155 to 
V-375; L-156 to V-375; S-157 to V-375; T-158 to V-375; G-159 to V-375; C-160 to V- 
375; T-161 to V-375; G-162 to V-375; T-163 to V-375; L-164 to V-375; V-165 to V- 

15 375; A-I66 to V-375; E-167 to V-375; K-168 to V-375; H-169 to V-375; V-170 to V- 
375; L-171 to V-375; T-172 to V-375; A-173 to V-375; A- 174 to V-375; H-175 to V- 
375; C-176 to V-375; 1-177 to V-375; H-178 to V-375; D-179 to V-375; G-180 to V- 
375; K-181 to V-375; T-182 to V-375; Y-183 to V-375; V-184 to V-375; K-185 to V- 
375; G-186 to V-375; T-187 to V-375; Q-188 to V-375; K-189 to V-375; L-190 to V- 

20 375; R-191 to V-375; V-192 to V-375; G-193 to V-375; F-194 to V-375; L-195 to V- 
375; K-196 to V-375; P-197 to V-375; K-198 to V-375; F-199 to V-375; K-200 to V- 
375; D-201 to V-375; G-202 to V-375; G-203 to V-375; R-204 to V-375; G-205 to V- 
375; A-206 to V-375; N-207 to V-375; D-208 to V-375; S-209 to V-375; T-210 to V- 
375; S-21 1 to V-375; A-212 to V-375; M-213 to V-375; P-214 to V-375; E-215 to V- 

25 375; Q-216 to V-375; M-217 to V-375; K-218 to V-375; F-219 to V-375; Q-220 to V- 
375; W-221 to V-375; 1-222 to V-375; R-223 to V-375; V-224 to V-375; K-225 to V- 
375; R-226 to V-375; T-227 to V-375; H-228 to V-375; V-229 to V-375; P-230 to V- 
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375; K-231 to V-375; G-232 to V-375; W-233 to V-375; 1-234 to V-375; K-235 to V- 
375; G-236 to V-375; N-237 to V-375; A-238 to V-375; N-239 to V-375; D-240 to V- 
375; 1-241 to V-375; G-242 to V-375; M-243 to V-375; D-244 to V-375; Y-245 to V- 
375; D-246 to V-375; Y-247 to V-375; A-248 to V-375; L-249 to V-375; L-250 to V- 
5 375; E-251 to V-375; L-252 to V-375; K-253 to V-375; K-254 to V-375; P-255 to V- 
375; H-256 to V-375; K-257 to V-375; R-258 to V-375; K-259 to V-375; F-260 to V- 
375; M-261 to V-375; K-262 to V-375; 1-263 to V-375; G-264 to V-375; V-265 to V- 
375; S-266 to V-375; P-267 to V-375; P-268 to V-375; A-269 to V-375; K-270 to V- 
375; Q-271 to V-375; L-272 to V-375; P-273 to V-375; G-274 to V-375; G-275 to V- 

10 375; R-276 to V-375; 1-277 to V-375; H-278 to V-375; F-279 to V-375; S-280 to V-375; 
G-281 to V-375; Y-282 to V-375; D-283 to V-375; N-284 to V-375; D-285 to V-375; R- 
286 to V-375; P-287 to V-375; G-288 to V-375; N-289 to V-375; L-290 to V-375; V-291 
to V-375; Y-292 to V-375; R-293 to V-375; F-294 to V-375; C-295 to V-375; D-296 to 
V-375; V-297 to V-375; K-298 to V-375; D-299 to V-375; E-300 to V-375; T-301 to V- 

15 375; Y-302 to V-375; D-303 to V-375; L-304 to V-375; L-305 to V-375; Y-306 to V- 
375; Q-307 to V-375; Q-308 to V-375; C-309 to V-375; D-310 to V-375; S-311 to V- 
375; Q-312 to V-375; P-313 to V-375; G-314 to V-375; A-315 to V-375; S-316 to V- 
375; G-317 to V-375; S-318 to V-375; G-319 to V-375; V-320 to V-375; Y-321 to V- 
375; V-322 to V-375; R-323 to V-375; M-324 to V-375; W-325 to V-375; K-326 to V- 

20 375; R-327 to V-375; Q-328 to V-375; H-329 to V-375; Q-330 to V-375; K-331 to V- 
375; W-332 to V-375; E-333 to V-375; R-334 to V-375; K-335 to V-375; 1-336 to V- 
375; 1-337 to V-375; G-338 to V-375; M-339 to V-375; 1-340 to V-375; S-341 to V-375; 
G-342 to V-375; H-343 to V-375; Q-344 to V-375; W-345 to V-375; V-346 to V-375; 
D-347 to V-375; M-348 to V-375; D-349 to V-375; G-350 to V-375; S-351 to V-375; P- 

25 352 to V-375; Q-353 to V-375; E-354 to V-375; F-355 to V-375; T-356 to V-375; R-357 
to V-375; G-358 to V-375; C-359 to V-375; S-360 to V-375; E-361 to V-375; 1-362 to 
V-375; T-363 to V-375; P-364 to V-375; L-365 to V-375; Q-366 to V-375; Y-367 to V- 
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375; 1-368 to V-375; P-369 to V-375; D-370 to V-375; of SEQ ID NO:38. Polypeptides 
encoded by these polynucleotides are also encompassed by the invention. 

Also as mentioned above, even if deletion of one or more amino acids from the 
C-terminus of a protein results in modification or loss of one or more biological 
functions of the protein, other functional activities (e.g., biological activities such as 
ability to modulate the extracellular matrix, etc.) may still be retained. For example the 
ability to induce and/or bind to antibodies which recognize the complete or mature forms 
of the polypeptide generally will be retained when less than the majority of the residues 
of the complete or mature polypeptide are removed from the C-terminus. Whether a 
particular polypeptide lacking C-terminal residues of a complete polypeptide retains such 
immunologic activities can readily be determined by routine methods described herein 
and otherwise known in the art. It is not unlikely that a mutein with a large number of 
deleted C-terminal amino acid residues may retain some biological or immunogenic 
activities. In fact, peptides composed of as few as six amino acid residues may often 
evoke an immune response. 

Accordingly, the present invention further provides polypeptides having one or 
more residues deleted from the carboxy terminus of the amino acid sequence of the 
polypeptide shown in Figures 25A-B, up to the glycine residue at position number 6, and 
polynucleotides encoding such polypeptides. In particular, the present invention provides 
polypeptides comprising the amino acid sequence of residues 1-ml of Figures 25A-B, 
where ml is an integer from 6 to 375 corresponding to the position of the amino acid 
residue in Figures 25A-B. Moreover, the invention provides polynucleotides encoding 
polypeptides comprising, or alternatively consisting of, the amino acid sequence of C- 
terminal deletions of the polypeptide of the invention shown as SEQ ID NO:38 include 
polypeptides comprising the amino acid sequence of residues: M-l to G-374; M-l to I- 
373; M-l to S-372; M-l to 1-371; M-l to D-370; M-l to P-369; M-l to 1-368; M-I to Y- 
367; M-l to Q-366; M-l to L-365; M-l to P-364; M-l to T-363; M-l to 1-362; M-l to E- 
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361; M-l to S-360; M-l to C-359; M-l to G-358; M-l to R-357; M-l to T-356; M-l to 
F-355; M-l to E-354; M-l to Q-353; M-l to P-352; M-l to S-351; M-l to G-350; M-l to 
D-349; M-l to M-348; M-l to D-347; M-l to V-346; M-l to W-345; M-l to Q-344; M-i 
to H-343; M-l to G-342; M-l to S-341; M-l to 1-340; M-l to M-339; M-l to G-338; M- 
5 1 to 1-337; M-l to 1-336; M-l to K-335; M-l to R-334; M-l to E-333; M-l to W-332; M- 
1 to K-331; M-l to Q-330; M-l to H-329; M-l to Q-328; M-l to R-327; M-l to K-326; 
M-l to W-325; M-l to M-324; M-l to R-323; M-l to V-322; M-l to Y-321; M-l to V- 
320; M-l to G-319;M-1 to S-318; M-l to G-317; M-l toS-316;M-l to A-315; M-l to 
G-314;M-1 to P-313; M-l to Q-312; M-l to S-311; M-l to D-310; M-l to C-309; M-l 

10 to Q-308; M-l to Q-307; M-l to Y-306; M-l to L-305; M-i to L-304; M-l to D-303; M- 
1 to Y-302; M-l to T-301; M-l to E-300; M-l to D-299; M-l to K-298; M-l to V-297; 
M-l to D-296; M-l to C-295; M-l to F-294; M-l to R-293; M-l to Y-292; M-l to V- 
291; M-l toL~290; M-l to N-289; M-l to G-288; M-l to P-287; M-l to R-286; M-l to 
D-285; M-l to N-284; M-l to D-283; M-l to Y-282; M-l to G-281; M-l to S-280; M-l 

15 to F-279; M-l to H-278; M-l to 1-277; M-i to R-276; M-l to G-275; M-i to G-274; M-l 
to P-273; M-l to L-272; M-i to Q-271; M-l to K-270; M-l to A-269; M-l to P-268; M-l 
to P-267; M-l to S-266; M-l to V-265; M-l to G-264; M-l to 1-263; M-l to K-262; M-l 
to M-261; M-l to F-260; M-l to K-259; M-l to R-258; M-l to K-257; M-l to H-256; M- 
1 to P-255; M-l to K-254; M-l to K-253; M-l to L-252; M-i to E-251; M-l to L-250; 

20 M-l to L-249; M-l to A-248; M-i to Y-247; M-l to D-246; M-i to Y-245; M-l to D- 
244; M-l to M-243; M-l to G-242; M-l to 1-241; M-l to D-240; M-l to N-239; M-l to 
A-238; M-l to N-237; M-l to G-236; M-l to K-235; M-l to 1-234; M-l to W-233; M-l 
to G-232; M-l to K-231; M-l to P-230; M-l to V-229; M-l to H-228; M l to T-227; M- 
1 to R-226; M-l to K-225; M-l to V-224; M-l to R-223; M-l to 1-222; M-l to W-221; 

25 M-l to Q-220; M-l to F-219; M-l to K-218; M-l to M-217; M l to Q-216; M-l to E- 
215; M-l toP-214; M-l to M-213; M-i to A-212; M-i to S-211;M-1 to T-210; M-l to 
S-209; M-l to D-208; M-l to N-207; M-l to A-206; M-l to G-205; M-l to R-204; M-l 
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to G-203; M-l to G-202; M-l to D-201; M-l to K-200; M-l to F-199; M-l to K-198; M- 
1 to P-197; M-l to K-196; M-l to L-195; M-l to F-194; M-l to G-193; M-l to V-192; 
M-l toR-191; M-l toL-190; M-l to K- 189; M-l to Q-1S8;M-1 toT-187;M-l toG- 
186; M-l to K-185;M-1 to V- 184; M-l toY-183;M-l to T-182;M-1 toK-181;M-l to 
G-180; M-l toD-179; M-l to H- 178; M-l to 1-177; M-l to C-176; M-l toH-175;M-l to 
A-174;M-1 to A-173; M-l to T-172; M-l to L-171;M-1 to V-170;M-1 to H-169;M-1 
to K-168; M-l to E-167; M-l to A-166; M-l to V-165; M-l to L-164; M-l toT-163;M- 
1 to G-162;M-1 toT-161; M-l to C-160; M-l toG-159; M-l to T-158; M-l toS-157; 
M-l to L-156; M-l to K-155; M-l to V-154; M-l to S-153; M-l to T-152; M-l to S-151 ; 
M-l to F-150; M-l to P-149; M-l to Y-148; M-l to N-147; M-l to L-146; M-l to L-145; 
M-l toF-144;M-l toD-143; M-l to K-142; M-l to G-141; M-l to F- 140; M-l to 1-139; 
M-l to S-138; M-l to F-137; M-l to R-136; M-l to S-135; M-l to D-134; M-l to Y-133; 
M-l to G-132; M-l to Y-131;M-1 to 1-130; M-l to Q-129; M-l to R-128;M-1 to K-127; 
M-I to R-126; M-l to R-125; M-l to S-124; M-l to K-123; M-l to G-122; M-l to S-121; 
M-l toS-120; M-l to G-119;M-1 to S-l 18; M-l to D-117;M-1 toR-116; M-l to H- 
115; M-l to Q-114; M-l to A-113; M-l to G-112; M-l to D-111;M-1 to G-110; M-l to 
S-109; M-l to S-108; M-l to S-107; M-l to L-106; M-l to 1-105; M-l to Y-104; M-l to 
1-103; M-l to G-102; M-l to V-101; M-l to Q-100; M-l to T-99; M-l to E-98; M-l to T- 
97; M-l to R-96; M-l to S-95; M-l to G-94; M-l to N-93; M-l to A-92; M-l to Y-91; 
M-l to L-90; M-l to T-89; M-l to E-88; M-l to Y-87; M-l to S-86; M-l to L-85; M-l to 
Y-84; M-l to Q-83; M-l to K-82; M-l to A-81; M-l to E-80; M-l to E-79; M-l to Y-78; 
M-l to T-77; M-l to P-76; M-l to L-75; M-l to P-74; M-l to T-73; M-l to G-72; M-l to 
K-71; M-l to H-70; M l to C-69; M-l to Q-68; M-l to P-67; M-l to G-66; M-l to C-65; 
M-l to S-64; M-l to S-63; M-l to S-62; M-l to V-61; M-l to E-60; M-l to L-59; M-l to 
K-58; M-l to A-57; M-l to E-56; M-l to A-55; M-l to G-54; M-l to F-53; M-l to D-52; 
M-l to P-51; M-l to K-50; M-l to A-49; M-l to L-48; M-l to N-47; M-l to L-46; M-l 
to T-45; M-l to S-44; M-l to Q-43; M-l to P-42; M-l to L-41; M-l to V-40; M-l to V- 
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39; M-l to P-38; M-l to L-37; M-l to R-36; M-l to Y-35; M-l to A-34; M-l to P-33; M- 
i to W-32; M-l to T-31; M-l to P-30; M-l to K-29; M-l to W-28; M-l to P-27; M-l to 
A-26; M-l to S-25; M-l to Y-24; M-l to P23; M-l to S-22; M-l to V-21; M-l to Q-20; 
M-l to G-19; M-l to V-18; M-l to A-17; M-l to C-16; M-l to L-15; M-l to L-14; M-l 
to F~13;M-1 to F-12; M-l to L-11;M-1 to L-10; M-l toF-9;M-l to L-8; M-l toL-7; 
M-l to G-6; of SEQ ID NO:38. Polypeptides encoded by these polynucleotides are also 
encompassed by the invention. 

In addition, the invention provides nucleic acid molecules having nucleotide 
sequences related to extensive portions of SEQ ID NO:20 which have been determined 
from the following related cDNA genes: HFKCF40F (SEQ ID NO:128), HSRDF26R 
(SEQ ID NO: 129), HTEBE07R (SEQ ID NO:130), HFTBP82R (SEQ ID NO:131), 
HAQBJ1 1R (SEQ ID NO: 132), HAFBB1 1R (SEQ ID NO:133), HOEF085R (SEQ ID 
NO: 134), and HUVGY95R (SEQ ID NO: 135). 

The gene encoding the disclosed cDNA is believed to reside on chromosome 12. 
Accordingly, polynucleotides related to this invention are useful as a marker in linkage 
analysis for chromosome 12. 

This gene is expressed primarily in endothelial cells, fibroblasts, smooth muscle, 
and osteoblasts, and to a lesser extent in brain, heart, placental tissues, lung, and many 
other tissues. Moreover, the transcript is present in HUVEC, HUVEC +LPS, smooth 
muscle, fibroblasts; present in heart, brain, placenta, lung, liver, muscle, kidney, 
pancreas, spleen, thymus, prostate, testes, ovary, small intestine, colon and weakly in 
PBLs. 

Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identification of the tissue(s) or celt type(s) present in a 
biological sample and for diagnosis of diseases and conditions which include, but are not 
limited to, disorders of vascularized tissues. Similarly, polypeptides and antibodies 
directed to these polypeptides are useful in providing immunological probes for 


SUBSTITUTE SHEET (RULE 26) 


WO 00/29435 PCT/US99/25031 

180 

differential identification of the tissue(s) or cell type(s). For a number of disorders of the 
above tissues or cells, particularly of the vascular tissues, expression of this gene at 
significantly higher or lower levels is routinely detected in certain tissues or cell types 
(e.g. vascular, skeletal, developmental, neural, cardiovascular, pulmonary, renal, 
5 immune, hematopoietic, reproductive, gastrointestinal, and cancerous and wounded 

tissues) or bodily fluids (e.g., lymph, seminal, fluid, amniotic fluid, serum, plasma, urine, 
synovial fluid and spinal fluid) or another tissue or cell sample taken from an individual 
having such a disorder, relative to the standard gene expression level, i.e., the expression 
level in healthy tissue or bodily fluid from an individual not having the disorder. 

10 Preferred polypeptides of the present invention comprise immunogenic epitopes 

shown in SEQ ID NO: 38 as residues: Pro-67 to Thr-73, Pro-76 to Gln-83, Asn-93 to 
Thr-99, His-115 to Arg-128, His-178 to Lys-189, Pro-197 to Ala-212, Val-224 to Trp- 
233, Lys-253 to Lys-259, Ser-280 to Asn-289, Asp-296 to Tyr-302, Gln-308 to Ala-315, 
Arg-327 to Lys-335, Asp-349 to Gly-358. Polynucleotides encoding said polypeptides 

15 are also provided. 

The tissue distribution in the vascularized endothelial cells indicates that 
polynucleotides and polypeptides corresponding to this gene are useful for the diagnosis 
and treatment of diseases of vascularized tissues, such as atherosclerosis, ataxia 
malabsortion, and hyperlipidemia. These and other factors often result in other 

20 cardiovascular disease. Furthermore, translation product of this gene is useful for the 
treatment of wounds, and may facilitate the wound healing process. Moreover, the 
protein is useful in the detection, treatment, and/or prevention of a variety of vascular 
disorders and conditions, which include, but are not limited to mi scro vascular disease, 
vascular leak syndrome, aneurysm, stroke, embolism, thrombosis, coronary artery 

25 disease, arteriosclerosis, and/or atherosclerosis. Based upon the tissue distribution of this 
protein, antagonists directed against this protein is useful in blocking the activity of this 
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protein. Accordingly, preferred are antibodies which specifically bind a portion of the 
translation product of this gene. 

Also provided is a kit for detecting tumors in which expression of this protein 
occurs. Such a kit comprises in one embodiment an antibody specific for the translation 
product of this gene bound to a solid support. Also provided is a method of detecting 
these tumors in an individual which comprises a step of contacting an antibody specific 
for the translation product of this gene to a bodily fluid from the individual, preferably 
serum, and ascertaining whether antibody binds to an antigen found in the bodily fluid. 
Preferably the antibody is bound to a solid support and the bodily fluid is serum. The 
above embodiments, as well as other treatments and diagnostic tests (kits and methods), 
are more particularly described elsewhere herein. Furthermore, the protein may also be 
used to determine biological activity, to raise antibodies, as tissue markers, to isolate 
cognate ligands or receptors, to identify agents that modulate their interactions, in 
addition to its use as a nutritional supplement. Protein, as well as, antibodies directed 
against the protein may show utility as a tumor marker and/or immunotherapy targets for 
the above listed tissues. 

Many polynucleotide sequences, such as EST sequences, are publicly available 
and accessible through sequence databases. Some of these sequences are related to SEQ 
ID NO:20 and may have been publicly available prior to conception of the present 
invention. Preferably, such related polynucleotides are specifically excluded from the 
scope of the present invention. To list every related sequence is cumbersome. 
Accordingly, preferably excluded from the present invention are one or more 
polynucleotides comprising a nucleotide sequence described by the general formula of a- 
b, where a is any integer between 1 to 1685 of SEQ ID NO:20, b is an integer of 15 to 
1699, where both a and b correspond to the positions of nucleotide residues shown in 
SEQ ID NO:20, and where b is greater than or equal to a + 14. 
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FEATURES OF PROTEIN ENCODED BY GENE NO: 11 

The translation product of this gene shares sequence homology with Cytotoxic- 
Regulatory T-Cell Associated Molecule (CRTAM) protein, which is thought to be 
important in the regulation of celluar physiology, development, differentiation or 
function of various cell types, including haematopoietic cells and various T-cell 
progenitors. See for example, PCT publication WO 96/34102 incorporated herein by 
reference in its entirety. Moreover, the protein product of this gene also shares homology 
with the thymocyte activation and developmental protein and the class-I MHC-restricted 
T cell associated molecule (See Genbank Accession Nos. gi|2665790, gb|AAB88491.1, 
gb|AAC80267.1, and gi|3930163; all information and references contained within these 
accessions are hereby incorporated herein by reference). Based on the sequence 
similarity, the translation product of this gene is expected to share at least some 
biological activities with T-cell modulatory proteins. Such activities are known in the art, 
some of which are described elsewhere herein. 

Preferred polypeptides of the invention comprise the following amino acid 
sequence: 

MASVVLPSGSQCAAAAAAAAPPGLRLRLLLLLFSAAALIPTGDGQNLFTKDVTVI 

EGEVATISCQVNKSDDSVIQLLNPNRQTIYFRDFRPLKDSRFQLLNFSSSELKVSL 

TNVSISDEGRYFCQLYTDPPQESYTTITVLVPPRNLMIDIQKDTAVEGEEIEVNCT 

AMASKPATTIRWFKGNTELKGKSEVEEWSDMYTVTSQLMLKVHKEDDGVPVIC 

QVEHPAVTGNLQTQRYLEVQYKPQVHIQMTYPLQGLTREGDALELTCEAIGKPQ 

PVMVTWVRVDDEMPQHAVLSGPNLFINNLNKTDNGTYRCEASNIVGKAHSDY 

MLYVYDPPTTIPPPTTTTTTTTTTTT^ 

VVFAMLCLLIILGRYFARHKGTYFTHEAKGADDAADADTAIINAEGGQNNSEEK 
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KEYFI (SEQ ID NO: 136). Polynucleotides encoding these polypeptides are also 
provided. 

The polypeptide of this latter embodiment has been determined to have a 
transmembrane domain at about amino acid position 379 - 395 of the amino acid 
sequence referenced in Table XIII for this gene. Moreover, a cytoplasmic tail 
encompassing amino acids 396 to 442 of this protein has also been determined. Based 
upon these characteristics, it is believed that the protein product of this gene shares 
structural features to type la membrane proteins. 

Preferred polynucleotides comprise the following sequence: 
ATG G CG AGTGT AGTG C 

TGCCGAGCGGATCCCAGTGTGCGGCGGCAGCGGCGGCGGCGGCGCCTCCCG 

GGCTCCGGCTCCGGCTTCTGCTGTTGCTCTTCTCCGCCGCGGCACTGATCCCC 

ACAGGTGATGGGCAGAATCTGTTTACGAAAGACGTGACAGTGATCGAGGGA 

GAGGTTGCGACCATCAGTTGCCAAGTCAATAAGAGTGACGACTCTGTGATTC 

AGCTACTGAATCCCAACAGGCAGACCATTTATTTCAGGGACTTCAGGCCTTT 

GAAGGACAGCAGGTTTCAGTTGCTGAATTTTTCTAGCAGTGAACTCAAAGTA 

TCATTGACAAACGTCTCAATTTCTGATGAAGGAAGATACTTTTGCCAGCTCTA 

TACCGATCCCCCACAGGAAAGTTACACCACCATCACAGTCCTGGTCCCACCA 

CGTAATCTGATGATCGATATCCAGAAAGACACTGCGGTGGAAGGTGAGGAG 

ATTGAAGTCAACTGCACTGCTATGGCCAGCAAGCCAGCCACGACTATCAGGT 

GGTTCAAAGGGAACACAGAGCTAAAAGGCAAATCGGAGGTGGAAGAGTGGT 

CAGACATGTACACTGTGACCAGTCAGCTGATGCTGAAGGTGCACAAGGAGG 

ACGATGGGGTCCCAGTGATCTGCCAGGTGGAGCACCCTGCGGTCACTGGAAA 

CCTGCAGACCCAGCGGTATCTAGAAGTACAGTATAAGCCTCAAGTGCACATT 

CAGATGACTTATCCTCTACAAGGCTTAACCCGGGAAGGGGACGCGCTTGAGT 

TAACATGTGAAGCCATCGGGAAGCCCCAGCCTGTGATGGTAACTTGGGTGAG 

AGTCGATGATGAAATGCCTCAACACGCCGTACTGTCTGGGCCCAACCTGTTC 
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ATCAATAACCTAAACAAAACAGATAATGGTACATACCGCTGTGAAGCTTCAA 
ACATAGTGGGGAAAGCTCACTCGGATTATATGCTGTATGTATACGATCCCCC 
CACAACTATCCCTCCTCCCACAACAACCACCACCACCACCACCACCACCACC 
ACCACCATCCTTACCATCATCACAGATTCCCGAGCAGGTGAAGAAGGCTCGA 
5 TCAGGGCAGTGGATCATGCCGTGATCGGTGGCGTCGTGGCGGTGGTGGTGTT 
CGCCATGCTGTGCTTGCTCATCATTCTGGGGCGCTATTTTGCCAGACATAAAG 
GTACATACTTCACTCATGAAGCCAAAGGAGCCGATGACGCAGCAGACGCAG 
ACACAGCTATAATCAATGCAGAAGGAGGACAGAACAACTCCGAAGAAAAGA 
AAGAGTACTTCATCTAG (SEQ ID NO: 137). Also preferred are the polypeptides 

10 encoded by these polynucleotides. 

Figures 28A-B shows the nucleotide (SEQ ID NO:21) and deduced amino acid 
sequence (SEQ ID NO:39) of the present invention. Predicted amino acids from about 1 
to about 44 constitute the predicted signal peptide (amino acid residues from about 1 to 
about 44 in SEQ ID NO:39) and are represented by the underlined amino acid regions. 

15 Figure 29 shows the regions of similarity between the amino acid sequences of 

the present invention SEQ ID NO:39, the human poliovirus receptor protein (gi| 1524088) 
(SEQ ID NO: 138), the human class-I MHC -restricted T cell associated molecule 
(WO9634102) (SEQ ID NO: 144), and the Gallus gallus thymocyte activation and 
developmental protein (gb|AAB88491.1) (SEQ ID NO: 145). 

20 Figure 30 shows an analysis of the amino acid sequence of SEQ ID NO:39. 

Alpha, beta, turn and coil regions; hydrophilicity and hydrophobicity; amphipathic 
regions; flexible regions; antigenic index and surface probability are shown. 

The present invention provides isolated nucleic acid molecules comprising a 
polynucleotide encoding the polypeptide having the amino acid sequence shown in 

25 Figures 28A-B (SEQ ID NO:39), which was determined by sequencing a cloned cDNA. 
The nucleotide sequence shown in Figures 28A-B (SEQ ID NO:21) was obtained by 
sequencing a cloned cDNA (HOUDJ81), which was deposited on Nov. 17, 1998 at the 
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American Type Culture Collection, and given Accession Namber 203484. The deposited 
gene is inserted in the pSport plasmid (Life Technologies, RockviHe, MD) using the 
Sall/NotI restriction endonuclease cleavage sites. 

The present invention is further directed to fragments of the isolated nucleic acid 
molecules described herein. By a fragment of an isolated DNA molecule having the 
nucleotide sequence of the deposited cDNA or the nucleotide sequence shown m SEQ ID 
NO:21 is intended DNA fragments at least about 15nt, and more preferably at least about 
20 nt, still more preferably at least about 30 nt, and even more preferably, at least about 
40 nt in length which are useful as diagnostic probes and primers as discussed herein. Of 
course, larger fragments 50^1500 nt in length are also useful according to the present 
invention, as are fragments corresponding to most, if not all, of the nucleotide sequence 
of the deposited cDNA or as shown in SEQ ID NO;2l. By a fragment at ieast 20 nt in 
length, for example, is intended fragments which include 20 or more contiguous bases 
from the nucleotide sequence of the deposited cDNA or the nucleotide sequence as 
shown in SEQ ID NO:21, In this context "about" includes the particularly recited size, 
larger or smaller by several (5, 4, 3, 2, or 1) nucleotides, at either terminus or at both 
termini. Representative examples of polynucleotide fragments of the invention include, 
for example, fragments that comprise, or alternatively, consist of, a sequence from about 
nucleotide Z to about 50, from about 51 to about 100, from about 101 to about 150, from 
about 15 1 to about 200, from about 201 to about 250, from about 251 to about 300, from 
about 301 to about 350, from about 351 to about 400, from about 401 to about 450, from 
about 451 to about 500, and from about 501 to about 550, and from about 551 to about 
600, from about 601 to about 650, from about 651 to about 700, from about 701 to about 
750, from about 751 to about 800, from about 801 to about 850, from about 851 to about 
900, from about 901 to about 950, from about 951 to about 1000, from about 1001 to 
about 1050 ? from about 1051 to about 1 100, from about 1101 to about 1 150, from about 
1151 to about 1200, from about 1201 to about 1250, from about 1251 to about 1300, 
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from about 1301 to about 1350, from about 1351 to about 1400, from about 1401 to 
about 1450, from about 1451 to about 1500, from about 1501 to about 1520 of SEQ ID 
NO:21, or the complementary strand thereto, or the cDNA contained in the deposited 
gene. In this context "about" includes the particularly recited ranges, larger or smaller by 
several (5, 4, 3, 2, or 1) nucleotides, at either terminus or at both termini. In additional 
embodiments, the polynucleotides of the invention encode functional attributes of the 
corresponding protein. 

Preferred embodiments of the invention in this regard include fragments that 
comprise alpha-helix and alpha-helix forming regions (''alpha-regions"), beta-sheet and 
beta-sheet forming regions ("beta-regions"), turn and turn-forming regions ("turn- 
regions"), coil and coil-forming regions ("coil-regions"), hydrophilic regions, 
hydrophobic regions, alpha amphipathic regions, beta amphipathic regions, flexible 
regions, surface-forming regions and high antigenic index regions. The data representing 
the structural or functional attributes of the protein set forth in Figure 30 and/or Table X, 
as described above, was generated using the various modules and algorithms of the 
DNA*STAR set on default parameters. In a preferred embodiment, the data presented in 
columns VIII, IX, XIII, and XIV of Table X can be used to determine regions of the 
protein which exhibit a high degree of potential for antigenicity. Regions of high 
antigenicity are determined from the data presented in columns VIII, IX, XIII, and/or 
XIV by choosing values which represent regions of the polypeptide which are likely to 
be exposed on the surface of the polypeptide in an environment in which antigen 
recognition may occur in the process of initiation of an immune response. 

Certain preferred regions in these regards are set out in Figure 30, but may, as 
shown in Table X, be represented or identified by using tabular representations of the 
data presented in Figure 30. The DNA*STAR computer algorithm used to generate 
Figure 30 (set on the original default parameters) was used to present the data in Figure 
30 in a tabular format (See Table X). The tabular format of the data in Figure 30 is used 
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to easily determine specific boundaries of a preferred region. The above-mentioned 
preferred regions set out in Figure 30 and in Table X include, but are not limited to, 
regions of the aforementioned types identified by analysis of the amino acid sequence set 
out in Figures 28A-B. As set out in Figure 30 and in Table X, such preferred regions 
include Garni er-Robson alpha-regions, beta-regions, turn-regions, and coil-regions, 
Chou-Fasman alpha-regions, beta-regions, and turn -regions, Kyte-Doo little hydrophilic 
regions and Hopp- Woods hydrophobic regions, Eisenberg alpha- and beta- amphi path ic 
regions, Karplus-Schulz flexible regions, Jameson -Wolf regions of high antigenic index 
and Emini surface-forming regions. Even if deletion of one or more amino acids from the 
N-terminus of a protein results in modification of loss of one or more biological 
functions of the protein, other functional activities (e.g., biological activities, ability to 
multimerize, etc.) may still be retained. For example, the ability of shortened muteins to 
induce and/or bind to antibodies which recognize the complete or mature forms of the 
polypeptides generally will be retained when less than the majority of the residues of the 
complete or mature polypeptide are removed from the N-terminus. Whether a particular 
polypeptide lacking N-terminal residues of a complete polypeptide retains such 
immunologic activities can readily be determined by routine methods described herein 
and otherwise known in the art. It is not unlikely that a mutein with a large number of 
deleted N-terminal amino acid residues may retain some biological or immunogenic 
activities. In fact, peptides composed of as few as six amino acid residues may often 
evoke an immune response. 

Accordingly, the present invention further provides polypeptides having one or 
more residues deleted from the amino terminus of the amino acid sequence shown in 
Figures 28A-B, up to the threonine residue at position number 359 and polynucleotides 
encoding such polypeptides. In particular, the present invention provides polypeptides 
comprising the amino acid sequence of residues nl-364 of Figures 28A-B, where nl is an 
integer from 2 to 359 corresponding to the position of the amino acid residue in Figures 
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28A-B (which is identical to the sequence shown as SEQ ID NO:39). N-terminal 
deletions of the polypeptide of the invention shown as SEQ ID NO:39 include 
polypeptides comprising the amino acid sequence of residues: A-2 to R-364; S-3 to R- 
364; V-4 to R-364; V-5 to R-364; L-6 to R-364; P-7 to R-364; S-8 to R-364; G-9 to R- 
364; S-10 to R-364; Q-ll to R-364; C-12 to R-364; A-13 to R-364; A-14 to R-364; A-15 
to R-364; A-16 to R-364; A-17 to R-364; A-18 to R-364; A-19 to R-364; A-20 to R-364; 
P-21 to R-364; P-22 to R-364; G-23 to R-364; L-24 to R-364; R-25 to R-364; L-26 to R- 
364; R-27 to R-364; L-28 to R-364; L-29 to R-364; L-30 to R-364; L-31 to R-364; L-32 
to R-364; F-33 to R-364; S-34 to R-364; A-35 to R-364; A-36 to R-364; A-37 to R-364; 
L-38 to R-364; 1-39 to R-364; P-40 to R-364; T-41 to R-364; G-42 to R-364; D-43 to R- 
364; G-44 to R-364; Q-45 to R-364; N-46 to R-364; L-47 to R-364; F-48 to R-364; T-49 
to R-364; K-50 to R-364; D-51 to R-364; V-52 to R-364; T-53 to R-364; V-54 to R-364; 
1-55 to R-364; E-56 to R-364; G-57 to R-364; E-58 to R-364; V-59 to R-364; A-60 to R- 
364; T-61 to R-364; 1-62 to R-364; S-63 to R-364; C-64 to R-364; Q-65 to R-364; V-66 
to R-364; N-67 to R-364; K-68 to R-364; S-69 to R-364; D-70 to R-364; D-71 to R-364; 
S-72 to R-364; V 73 to R-364; 1-74 to R-364; Q-75 to R-364; L-76 to R-364; L-77 to R- 
364; N-78 to R-364; P-79 to R-364; N-80 to R-364; R-81 to R-364; Q-82 to R-364; T-83 
to R-364; 1-84 to R-364; Y-85 to R-364; F-86 to R-364; R-87 to R-364; D-88 to R-364; 
F-89 to R-364; R-90 to R-364; P-91 to R-364; L-92 to R-364; K-93 to R-364; D-94 to R- 
364; S-95 to R-364; R-96 to R-364; F-97 to R-364; Q-98 to R-364; L-99 to R-364; L-100 
to R-364; N-101 to R-364; F-102 to R-364; S-103 to R-364; S-104 to R-364; S-105 to R- 
364; E-106 to R-364; L-107 to R-364; K-108 to R-364; V-109 to R-364; S-l 10 to R-364; 
L-l 1 1 to R-364; T-l 12 to R-364; N-l 13 to R-364; V-l 14 to R-364; S-l 15 to R-364; I- 
116 to R-364; S-l 17 to R-364; D-118 to R-364; E-119 to R-364; G-120 to R-364; R-121 
to R-364; Y-122 to R-364; F-123 to R-364; C-124 to R-364; Q-125 to R-364; L-126 to 
R-364; Y-127 to R-364; T-128 to R-364; D-129 to R-364; P-130 to R-364; P-131 to R- 
364; Q-132 to R-364; E-133 to R-364; S-134 to R-364; Y-135 to R-364; T-136 to R-364; 
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T-137 to R-364; 1-138 to R-364; T-139 to R-364; V-140 to R-364; L-141 to R-364; V- 
142 to R-364; P-143 to R-364; P-144 to R-364; R-145 to R-364; N-146 to R-364; L-147 
to R-364; M-148 to R-364; 1-149 to R-364; D-150 to R 364; 1-151 to R-364; Q-152 to R- 
364; K-153 to R-364; D-154 to R-364; T-155 to R-364; A-156 to R-364; V-157 to R- 
364; E-158 to R-364; G-159 to R-364; E-160 to R-364; E-161 to R-364; 1-162 to R-364; 
E-163 to R-364; V-164 to R-364; N-165 to R-364; C-I66 to R-364; T-167 to R-364; A- 
168 to R-364; M-169 to R-364; A-170 to R-364; S-171 to R-364; K-172 to R-364; P-173 
to R-364; A-174 to R-364; T-175 to R-364; T-176 to R-364; 1-177 to R-364; R-178 to R- 
364; W-179 to R-364; F-180 to R-364; K-181 to R-364; G-182 to R-364; N-183 to R- 
364; T-184 to R-364; E-185 to R-364; L-186 to R-364; K-187 to R-364; G-188 to R-364; 
K-189 to R-364; S-190 to R-364; E-191 to R-364; V-192 to R-364; E-193 to R-364; E- 
194 to R-364; W-195 to R-364; S-196 to R-364; D-197 to R-364; M-198 to R-364; Y- 
199 to R-364; T-200 to R-364; V-201 to R-364; T-202 to R-364; S-203 to R-364; Q-204 
to R-364; L-205 to R-364; M-206 to R-364; L-207 to R-364; K-208 to R-364; V-209 to 
R-364; H-210 to R-364; K-21 1 to R-364; E-212 to R-364; D-213 to R-364; D-214 to R- 
364; G-215 to R-364; V-2I6 to R-364; P-217 to R-364; V-218 to R-364; 1-219 to R-364; 
C-220 to R-364; Q-221 to R-364; V-222 to R-364; E-223 to R-364; H-224 to R-364; P- 
225 to R-364; A-226 to R-364; V-227 to R-364; T-228 to R-364; G-229 to R-364; N-230 
to R-364; L-231 to R-364; Q-232 to R-364; T-233 to R-364; Q-234 to R-364; R-235 to 
R-364; Y-236 to R-364; L-237 to R-364; E-238 to R-364; V-239 to R-364; Q-240 to R- 
364; Y-241 to R-364; K-242 to R-364; P-243 to R-364; Q-244 to R-364; V-245 to R- 
364; H-246 to R-364; 1-247 to R-364; Q-248 to R-364; M-249 to R-364; T-250 to R-364; 
Y-251 to R-364; P-252 to R-364; L 253 to R-364; Q-254 to R-364; G-255 to R-364; L- 
256 to R-364; T-257 to R-364; R-258 to R-364; E-259 to R-364; G-260 to R-364; D-261 
to R-364; A-262 to R-364; L-263 to R-364; E-264 to R-364; L-265 to R-364; T-266 to 
R-364; C-267 to R-364; E-268 to R-364; A-269 to R-364; 1-270 to R-364; G-271 to R- 
364; K-272 to R-364; P-273 to R-364; Q-274 to R-364; P-275 to R-364; V-276 to R-364; 
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M-277 to R-364; V-278 to R-364; T-279 to R-364; W-280 to R-364; V-281 to R-364; R- 
282 to R-364; V-283 to R-364; D-284 to R-364; D-285 to R-364; E-286 to R-364; M- 
287 to R-364; P-288 to R-364; Q-289 to R-364; H-290 to R-364; A-291 to R-364; V-292 
to R-364; L-293 to R-364; S-294 to R-364; G-295 to R-364; P-296 to R-364; N-297 to 
R-364; L-298 to R-364; F-299 to R-364; 1-300 to R-364; N-301 to R-364; N-302 to R- 
364; L-303 to R-364; N 304 to R-364; K-305 to R-364; T-306 to R-364; D-307 to R- 
364; N-308 to R-364; G-309 to R-364; T-310 to R-364; Y-311 to R-364; R-3I2 to R- 
364; C-313 to R-364; E-314 to R-364; A-315 to R-364; S-316 to R-364; N-317 to R-364; 
1-318 to R-364; V-319 to R-364; G-320 to R-364; K-321 to R-364; A-322 to R-364; H- 
323 to R-364; S-324 to R-364; D-325 to R-364; Y-326 to R-364; M-327 to R-364; L-328 
to R-364; Y-329 to R-364; V-330 to R-364; Y-331 to R-364; D-332 to R-364; P-333 to 
R-364; P-334 to R-364; T-335 to R-364; T-336 to R-364; 1-337 to R-364; P-338 to R- 
364; P-339 to R-364; P-340 to R-364; T-341 to R-364; T-342 to R-364; T-343 to R-364; 
T-344 to R-364; T-345 to R-364; T-346 to R-364; T-347 to R-364; T-348 to R-364; T- 
349 to R-364; T-350 to R-364; T-351 to R-364; T-352 to R-364; T-353 to R-364; 1-354 
to R-364; L-355 to R-364; T-356 to R-364; 1-357 to R-364; 1-358 to R-364; T-359 to R- 
364; of SEQ ID NO:39. Polypeptides encoded by these polynucleotides are also 
encompassed by the invention. 

Also as mentioned above, even if deletion of one or more amino acids from the 
C-terminus of a protein results in modification or loss of one or more biological 
functions of the protein, other functional activities (e.g., biological activities such as 
ability to modulate the extracellular matrix, etc.) may still be retained. For example the 
ability to induce and/or bind to antibodies which recognize the complete or mature forms 
of the polypeptide generally will be retained when less than the majority of the residues 
of the complete or mature polypeptide are removed from the C-terminus. Whether a 
particular polypeptide lacking C-terminal residues of a complete polypeptide retains such 
immunologic activities can readily be determined by routine methods described herein 
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and otherwise known in the art. It is not unlikely that a mutein with a large number of 
deleted C-terminal amino acid residues may retain some biological or immunogenic 
activities. In fact, peptides composed of as few as six amino acid residues may often 
evoke an immune response. 

Accordingly, the present invention further provides polypeptides having one or 
more residues deleted from the carboxy terminus of the amino acid sequence of the 
polypeptide shown in Figures 28A-B, up to the leucine residue at position number 6, and 
polynucleotides encoding such polypeptides. In particular, the present invention provides 
polypeptides comprising the amino acid sequence of residues 1-ml of Figures 28A-B, 
where ml is an integer from 6 to 364 corresponding to the position of the amino acid 
residue in Figures 28A-B. Moreover, the invention provides polynucleotides encoding 
polypeptides comprising, or alternatively consisting of, the amino acid sequence of C- 
terminal deletions of the polypeptide of the invention shown as SEQ ID NO:39 include 
polypeptides comprising the amino acid sequence of residues: M-l to A-363; M-i to R- 
362; M-l to S-361; M-l to D-360; M-l to T-359; M-l to 1-358; M-l to 1-357; M-l to T- 
356; M-l to L-355; M-l to 1-354; M-l to T-353; M-l to T-352; M-l to T-351; M-l to T- 
350; M-l to T-349; M-l to T-348; M-l to T-347; M-l to T-346; M-l to T-345; M-l to T- 
344; M-l to T-343; M-l to T-342; M-l to T-341; M-l to P-340; M-l to P-339; M-l to P- 
338; M-l to 1-337; M-l to T-336; M-l to T-335; M-l to P-334; M-l to P-333; M-l to D- 
332; M-l to Y-331; M-l to V-330; M-l to Y-329; M-l to L-328; M-l to M-327; M-l to 
Y-326; M-l to D-325; M-l to S-324; M-l to H-323; M-l to A-322; M-l to K-321; M-l 
to G-320; M-l to V-319; M-l to 1-318; M-l to N-317; M-l to S-316; M-l to A-315; M-l 
toE-314; M-l to C-313;M-1 toR-312;M-l to Y-311;M-1 toT-310; M-l to G-309; M- 
1 to N-308; M-l to D-307; M-l to T-306; M-l to K-305; M-l to N-304; M-l to L-303; 
M-l to N-302; M-l to N-301; M-l to 1-300; M-l to F-299; M-l to L-298; M-l to N-297; 
M-l to P-296; M-l to G-295; M-l to S-294; M-l to L-293; M-l to V-292; M-l to A-29I ; 
M-l to H-290; M-l to Q-289; M-l to P-288; M-l to M-287; M-l to E-286; M-l to D 
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285; M-l to D-284; M-l to V-283; M-l to R-282; M-l to V-281; M-i to W-280; M-l to 
T-279; M-l to V-278; M-l to M-277; M-l to V-276; M-l to P-275; M-l to Q-274; M-l 
to P-273; M-l to K-272; M-l to G-271; M-l to 1-270; M-l to A-269; M-l to E-268; M-l 
to C-267; M-l to T-266; M-l to L-265; M-l to E-264; M-l to L-263; M-l to A-262; M-l 
to D-261; M-l to G-260; M-l to E-259; M-l to R-258; M-l to T-257; M-l to L-256; M- 
1 to G-255; M-l to Q-254; M-l to L-253; M-l to P-252; M-l to Y-251; M-l to T-250; 
M-l to M-249; M-l to Q-248; M-l to 1-247; M-l to H-246; M-l to V-245; M-l to Q- 
244; M-l to P-243; M-l to K-242; M-I to Y-241 ; M-l to Q-240; M-l to V-239; M-l to 
E-238; M-l to L-237; M-l to Y-236; M-l to R-235; M-l to Q-234; M-l to T-233; M-I 
to Q-232; M-l to L-231; M-l to N-230; M-l to G-229; M-l to T-228; M-l to V-227; M- 
1 to A-226; M-l to P-225; M-l to H-224; M-l to E-223; M-l to V-222; M-l to Q-221; 
M-l to C-220; M-l to 1-219; M-l to V-218; M-l to P-217; M-l to V-216; M-l to G-215; 
M-l to D-214; M-l to D-213; M-l to E-212; M-l to K-21 1; M-l to H-210; M-l to V- 
209; M-l to K-208; M-l to L-207; M-l to M-206; M-l to L-205; M-l to Q-204; M-l to 
S-203; M-l to T-202; M-l to V-201; M-l to T-200; M-l to Y-199; M-l to M-198; M-l 
to D-197; M-l to S-196; M-l to W-195; M-l to E-I94; M-l to E-193; M-l to V-192; M- 
1 toE-191;M-l toS-190;M-l to K-189;M-1 to G-188;M-1 to K-187;M-1 toL-186; 
M-l to E-185;M~1 to T- 184; M-l toN-183; M-l toG-182; M-l toK-181; M-l toF- 
180; M-l to W-179; M-l to R-178; M-l to 1-177; M-l to T-176; M-l to T-175; M-l to 
A- 174; M-l to P-173; M-l to K-172; M-l to S-171; M-l to A-170; M-l to M-169; M-l 
to A-168; M-l toT-167;M-l to C-166; M-l toN-165;M-l to V-164; M-l to E~163;M- 
1 to 1-162; M-l toE-161;M-i to E-160;M-1 to G-159; M-l to E-158; M-l to V-157; M- 
1 to A-156; M-l to T-155; M-l to D-154; M-l to K-153; M-l to Q-152; M-l to 1-151; 
M-l to D-150; M-l to 1-149; M-l to M-148; M l to L-147; M-l to N-146; M-l to R- 
145; M-l to P-144; M-l to P-I43; M-l to V-142; M-l to L-141; M-l to V-140; M-l to 
T-139; M-l to 1-138; M-l to T-137; M-l to T-136; M-l to Y-135; M-l to S-134; M-l to 
E-133; M-l to Q-132; M-l to P-131; M-l to P-130; M-l to D-129; M-l to T-128; M-l to 
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Y-127; M-l to L-126; M-l to Q-125; M-l to C-124; M-l to F-123; M-l to Y-122; M-l 
to R-121;M-1 to G-120; M-l to E-119; M-l toD-118;M-l to S-117;M-1 to 1-116; M-l 
to S-l 15; M-l to V-l 14; M-l to N-l 13; M-l to T-l 12; M-l to L-l 1 1; M-l to S-l 10; M-l 
to V-I09; M-l to K-108; M-l to L-107; M-l to E-106; M-l to S-105; M-l to S-104; M-l 
5 to S-103; M-l to F-102; M-l to N-101; M-l to L-100; M-l to L-99; M-l to Q-98; M-l to 
F-97; M-l to R-96; M-l to S^95; M-l to D-94; M-l to K-93; M-l to L-92; M-l to P-91; 
M-l to R-90; M-l to F-89; M-i to D-88; M-l to R-87; M-l to F-86; M-l to Y-85; M-l to 
1-84; M-l to T-83; M-l to Q-82; M-l to R-81; M-l to N-80; M-l to P-79; M-l to N-78; 
M-l to L-77; M-l to L-76; M-l to Q-75; M-l to 1-74; M-l to V-73; M-l to S-72; M-l to 

10 D-71; M-l to D-70; M-l to S-69; M-l to K-68; M-i to N^67; M-l to V-66; M-l to Q-65; 
M-l to C-64; M-l to S-63; M-l to 1-62; M-l to T-61; M-l to A-60; M-l to V-59; M-l to 
E-58; M-l to G-57; M-l to E-56; M-l to 1-55; M-l to V-54; M-l to T-53; M-l to V-52; 
M-i to D-51; M-l to K-50; M-l to T-49; M-l to F-48; M-l to L-47; M-l to N-46; M-l 
to Q-45; M-l to G-44; M-l to D-43; M-l to G-42; M-l to T-41; M-l to P-40; M-l to I- 

15 39; M-l to L-38; M-l to A-37; M-l to A-36; M-l to A-35; M-l to S-34; M-l to F-33; M- 
1 to L-32; M-l to L-31; M-l to L-30; M-l to L-29; M-l to L-28; M-l to R~27; M-l to L- 
26; M-l to R-25; M-l to L-24; M-l to G-23; M-l to P-22; M-l to P-21; M-l to A-20; M- 
1 to A-19; M-l to A-18; M-l to A-17; M-l to A- 16; M-l to A-15; M-l to A-14; M-l to 
A-13;M-1 toC-12; M-l to Q-ll; M-l to S-10; M-l toG-9; M-l to S-8; M-l to P-7; M-l 

20 to L-6; of SEQ ID NO:39. Polypeptides encoded by these polynucleotides are also 
encompassed by the invention. 

In addition, the invention provides nucleic acid molecules having nucleotide 
sequences related to extensive portions of SEQ ID NO:21 which have been determined 
from the following related cDNA genes: HSQFJ92R (SEQ ID NO: 139), HFLAB18F 

25 (SEQ ID NO:140), HAQBH82R (SEQ ID NO:141), HLHTM10R (SEQ ID NO: 142), 
and HLHAL65R (SEQ ID NO: 143). 
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This gene is expressed primarily in immune system related tissues such as 
ulcerative colitis, rejected kidney tissues, and to a lesser extent in thymus and bone 
marrow. Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identification of the tissue(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditions which include, but are not 
limited to, immune and hematopoietic diseases and/or disorders, particularly ulcerative 
colitis and rejected organs. Similarly, polypeptides and antibodies directed to these 
polypeptides are useful in providing immunological probes for differentia] identification 
of the tissue(s) or cell type(s). For a number of disorders of the above tissues or cells, 
particularly of the immune system, expression of this gene at significantly higher or 
lower levels is routinely detected in certain tissues or cell types (e.g. transplanted kidney, 
immune, hematopoeitic, renal, and cancerous and wounded tissues) or bodily fluids (e.g., 
iymph, serum, plasma, urine, synovial fluid and spinal fluid) or another tissue or cell 
sample taken from an individual having such a disorder, relative to the standard gene 
expression level, i.e., the expression level in healthy tissue or bodily fluid from an 
individual not having the disorder. 

Preferred polypeptides of the present invention comprise immunogenic epitopes 
shown in SEQ ID NO: 39 as residues: Gly-42 to Phe-48, Val-66 to Asp-71, Asn-78 to 
Thr-83, Asp-88 to Arg-96, Tyr-127 to Tyr-135, Lys-181 to Trp-195, His-210 to Gly-215, 
Leu-303 to Thr-310, Thr-341 to Thr-350. Polynucleotides encoding said polypeptides are 
also provided. 

The tissue distribution primarily in immune cells and tissues, combined with the 
homology to the CRT AM, thymocyte activation and developmental protein, the class-I 
MHC-restricted T cell associated molecule protein, and the polivirus receptor, indicates 
that the protein products of this gene are useful for the regulation of celluar physiology, 
development, differentiation or function of various cell types, including haematopoietic 
cells and particularly T-ce!l progenitors. Representative uses are described in the 
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"Immune Activity" and "infectious disease" sections below, in Example 11, 13, 14, 16, 
18, 19, 20, and 27, and elsewhere herein. The proteins can be used to develop products 
for the diagnosis and treatment of conditions associated with abnormal physiology or 
development, including abnormal proliferation, e.g. cancers, or degenerative conditions. 
5 The physiology or development of a cell can be modulated by contacting the cell with an 
agonist or antagonist (i.e. an anti- CRTAM-like peptide antibody). Further the CRTAM- 
like polypeptides of the present invention include treatment of ulcerative colitis, organ 
rejection and other immune system related disorders. Agonists or antagonists may treat 
or prevent such disorders as ulcerative colitis and rejected organs, such as kidney. Based 

10 upon the tissue distribution of this protein, antagonists directed against this protein is 
useful in blocking the activity of this protein. Accordingly, preferred are antibodies 
which specifically bind a portion of the translation product of this gene. 

Also provided is a kit for detecting tumors in which expression of this protein 
occurs. Such a kit comprises in one embodiment an antibody specific for the translation 

15 product of this gene bound to a solid support. Also provided is a method of detecting 
these tumors in an individual which comprises a step of contacting an antibody specific 
for the translation product of this gene to a bodily fluid from the individual, preferably 
serum, and ascertaining whether antibody binds to an antigen found in the bodily fluid. 
Preferably the antibody is bound to a solid support and the bodily fluid is serum. The 

20 above embodiments, as well as other treatments and diagnostic tests (kits and methods), 
are more particularly described elsewhere herein. Furthermore, the protein may also be 
used to determine biological activity, to raise antibodies, as tissue markers, to isolate 
cognate ligands or receptors, to identify agents that modulate their interactions, in 
addition to its use as a nutritional supplement. Protein, as well as, antibodies directed 

25 against the protein may show utility as a tumor marker and/or immunotherapy targets for 
the above listed tissues. 
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Many polynucleotide sequences, such as EST sequences, are publicly available 
and accessible through sequence databases. Some of these sequences are related to SEQ 
ID NO:21 and may have been publicly available prior to conception of the present 
invention. Preferably, such related polynucleotides are specifically excluded from the 
scope of the present invention. To list every related sequence is cumbersome. 
Accordingly, preferably excluded from the present invention are one or more 
polynucleotides comprising a nucleotide sequence described by the general formula of a- 
b, where a is any integer between 1 to 1506 of SEQ ID NO:21 , b is an integer of 15 to 
1520, where both a and b correspond to the positions of nucleotide residues shown in 
SEQ ID NO:21, and where b is greater than or equal to a 4- 14. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 12 

Figure 31 shows the nucleotide (SEQ ID NO:22) and deduced amino acid 
sequence (SEQ ID NO:40) of the present invention. Predicted amino acids from about 1 
to about 23 constitute the predicted signal peptide (amino acid residues from about 1 to 
about 23 in SEQ ID NO:40) and are represented by the underlined amino acid regions. 

Figure 32 shows the regions of similarity between the amino acid sequences of 
the present invention SEQ ID NO:40 and the human FAP protein (gi|1890647) (SEQ ID 
NO: 146). 

Figure 33 shows an analysis of the amino acid sequence of SEQ ID NO:40. 

Alpha, beta, turn and coil regions; hydrophilicity and hydroph obi city; 
amphipathic regions; flexible regions; antigenic index and surface probability are shown. 

The present invention provides isolated nucleic acid molecules comprising a 
polynucleotide encoding the polypeptide having the amino acid sequence shown in 
Figure 31 (SEQ ID NO:40), which was determined by sequencing a cloned cDNA. The 
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nucleotide sequence shown in Figure 31 (SEQ ID NO:22) was obtained by sequencing a 
cloned cDNA (HPWCM76), which was deposited on Nov. 17, 1998 at the American 
Type Culture Collection, and given Accession Number 203484. The deposited gene is 
inserted in the pSport plasmid (Life Technologies, Rockville, MD) using the Sall/NotI 
restriction endonuclease cleavage sites. 

The present invention is further directed to fragments of the isolated nucleic acid 
molecules described herein. By a fragment of an isolated DNA molecule having the 
nucleotide sequence of the deposited cDNA or the nucleotide sequence shown in SEQ ID 
NO: 22 is intended DNA fragments at least about 15nt, and more preferably at least about 
20 nt, still more preferably at least about 30 nt, and even more preferably, at least about 
40 nt in length which are useful as diagnostic probes and primers as discussed herein. Of 
course, larger fragments 50-1500 nt in length are also useful according to the present 
invention, as are fragments corresponding to most, if not all, of the nucleotide sequence 
of the deposited cDNA or as shown in SEQ ID NO:22. By a fragment at least 20 nt in 
length, for example, is intended fragments which include 20 or more contiguous bases 
from the nucleotide sequence of the deposited cDNA or the nucleotide sequence as 
shown in SEQ ID NO: 22. In this context "about" includes the particularly recited size, 
larger or smaller by several (5, 4 T 3, 2, or 1) nucleotides, at either terminus or at both 
termini. Representative examples of polynucleotide fragments of the invention include, 
for example, fragments that comprise, or alternatively, consist of, a sequence from about 
nucleotide 1 to about 50, from about 51 to about 100, from about 101 to about 150, from 
about 151 to about 200, from about 201 to about 250, from about 251 to about 300, from 
about 301 to about 350, from about 351 to about 400, from about 401 to about 450, from 
about 451 to about 500, and from about 501 to about 550, and from about 551 to about 
600, from about 601 to about 650, from about 651 to about 700, from about 701 to about 
750, from about 751 to about 800, from about 801 to about 807 of SEQ ID NO:22, or the 
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complementary strand thereto, or the cDNA contained in the deposited gene. In this 
context "about" includes the particularly recited ranges, larger or smaller 

by several (5, 4, 3, 2, or 1) nucleotides, at either terminus or at both termini. In 
additional embodiments, the polynucleotides of the invention encode functional 
attributes of the corresponding protein. 

Preferred embodiments of the invention in this regard include fragments that 
comprise alpha-helix and alpha-helix forming regions ("alpha-regions"), beta-sheet and 
beta-sheet forming regions ("beta-regions"), turn and turn-forming regions ("turn- 
regions"), coil and coil-forming regions ("coil-regions"), hydrophilic regions, 
hydrophobic regions, alpha amphipathic regions, beta amphipathic regions, flexible 
regions, surface -forming regions and high antigenic index regions. The data representing 
the structural or functional attributes of the protein set forth in Figure 33 and/or Table XI, 
as described above, was generated using the various modules and algorithms of the 
DNA*STAR set on default parameters. In a preferred embodiment, the data presented in 
columns VIII, IX, XIII, and XIV of Table XI can be used to determine regions of the 
protein which exhibit a high degree of potential for antigenicity. Regions of high 
antigenicity are determined from the data presented in columns VIII, IX, XIII, and/or 
XIV by choosing values which represent regions of the polypeptide which are likely to 
be exposed on the surface of the polypeptide in an environment in which antigen 
recognition may occur in the process of initiation of an immune response. 

Certain preferred regions in these regards are set out in Figure 33, but may, as 
shown in Table XI, be represented or identified by using tabular representations of the 
data presented in Figure 33. The DNA*STAR computer algorithm used to generate 
Figure 33 (set on the original default parameters) was used to present the data in Figure 
33 in a tabular format (See Table XI). The tabular format of the data in Figure 33 is used 
to easily determine specific boundaries of a preferred region. The above-mentioned 
preferred regions set out in Figure 33 and in Table XI include, but are not limited to, 
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regions of the aforementioned types identified by analysis of the amino acid sequence set 
out in Figure 31. As set out in Figure 33 and in Table XI, such preferred regions include 
Garnier-Robson alpha-regions, beta-regions, turn-regions, and coil-regions, Chou- 
Fasman alpha-regions, beta-regions, and turn-regions, Kyte-Doolittle hydrophilic regions 
5 and Hopp- Woods hydrophobic regions, Eisenberg alpha- and beta-amphipathic regions, 
Karplus-Schulz flexible regions, Jameson-Wolf regions of high antigenic index and 
Emini surface-forming regions. Even if deletion of one or more amino acids from the N- 
terminus of a protein results in modification of loss of one or more biological functions 
of the protein, other functional activities (e.g., biological activities, ability to 

10 multimerize, etc.) may still be retained. For example, the ability of shortened muteins to 
induce and/or bind to antibodies which recognize the complete or mature forms of the 
polypeptides generally will be retained when less than the majority of the residues of the 
complete or mature polypeptide are removed from the N- terminus. Whether a particular 
polypeptide lacking N-terminal residues of a complete polypeptide retains such 

15 immunologic activities can readily be determined by routine methods described herein 
and otherwise known in the art. It is not unlikely that a mutein with a large number of 
deleted N-terminal amino acid residues may retain some biological or immunogenic 
activities. In fact, peptides composed of as few as six amino acid residues may often 
evoke an immune response. 

20 Accordingly, the present invention further provides polypeptides having one or 

more residues deleted from the amino terminus of the amino acid sequence shown in 
Figure 31, up to the arginine residue at position number 61 and polynucleotides encoding 
such polypeptides. In particular, the present invention provides polypeptides comprising 
the amino acid sequence of residues nl-66 of Figure 31, where nl is an integer from 2 to 

25 61 corresponding to the position of the amino acid residue in Figure 31 (which is 
identical to the sequence shown as SEQ ID NO: 40). N-terminal deletions of the 
polypeptide of the invention shown as SEQ ID NO:40 include polypeptides comprising 
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the amino acid sequence of residues: S-2 to N-66; S-3 to N-66; S-4 to N-66; S-5 to N-66; 
L-6 to N-66; K-7 to N-66; H-8 to N-66; L-9 to N-66; L-10 to N-66; C-ll to N-66; M-12 
to N-66; A-I3 to N-66; L-14 to N-66; S-15 to N-66; W-16 to N-66; F-17 to N-66; S-18 
to N-66; S-19 to N-66; F-20 to N-66; 1-21 to N-66; S-22 to N-66; G-23 to N-66; E-24 to 
N-66; T-25 to N-66; S-26 to N-66; F-27 to N-66; S-28 to N-66; L-29 to N-66; L-30 to N- 
66; N-31 to N-66; S-32 to N-66; F-33 to N-66; F-34 to N-66; L-35 to N-66; P-36 to N- 
66; Y-37 to N-66; P-38 to N-66; S-39 to N-66; S-40 to N-66; R-41 to N-66; C-42 to N- 
66; C-43 to N-66; C-44 to N-66; F-45 to N-66; S-46 to N-66; V-47 to N-66; Q-48 to N- 
66; C-49 to N-66; S-50 to N-66; 1-51 to N-66; L-52 to N-66; D-53 to N-66; P-54 to N- 
66; F-55 to N-66; S-56 to N-66; C-57 to N-66; N-58 to N-66; S-59 to N-66; M-60 to N- 
66; R-61 to N-66; of SEQ ID NO:40. Polypeptides encoded by these polynucleotides are 
also encompassed by the invention. 

Also as mentioned above, even if deletion of one or more amino acids from the 
C-terminus of a protein results in modification or loss of one or more biological 
functions of the protein, other functional activities (e.g., biological activities such as 
ability to modulate the extracellular matrix, etc.) may still be retained. For example the 
ability to induce and/or bind to antibodies which recognize the complete or mature forms 
of the polypeptide generally will be retained when less than the majority of the residues 
of the complete or mature polypeptide are removed from the C-terminus. Whether a 
particular polypeptide lacking C-terminal residues of a complete polypeptide retains such 
immunologic activities can readily be determined by routine methods described herein 
and otherwise known in the art. It is not unlikely that a mutein with a large number of 
deleted C-terminal amino acid residues may retain some biological or immunogenic 
activities. In fact, peptides composed of as few as six amino acid residues may often 
evoke an immune response. 

Accordingly, the present invention further provides polypeptides having one or 
more residues deleted from the carboxy terminus of the amino acid sequence of the 
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polypeptide shown in Figure 31, up to the leucine residue at position number 6, and 
polynucleotides encoding such polypeptides. In particular, the present invention provides 
polypeptides comprising the amino acid sequence of residues 1-ml of Figure 31, where 
ml is an integer from 6 to 66 corresponding to the position of the amino acid residue in 
Figure 31. Moreover, the invention provides polynucleotides encoding polypeptides 
comprising, or alternatively consisting of, the amino acid sequence of C-terminal 
deletions of the polypeptide of the invention shown as SEQ ID NO:40 include 
polypeptides comprising the amino acid sequence of residues: M-l to E-65; M-l to W- 
64; M-l to P-63; M-l to F-62; M-l to R-6I; M-l to M-60; M-l to S-59; M-l to N-58; 
M-l to C-57; M-l to S-56; M-l to F-55; M-l to P-54; M-l to D-53; M-l to L-52; M-l to 
1-51; M-l to S-50; M-l to C-49; M-l to Q-48; M-l to V-47; M-l to S-46; M-l to F-45; 
M-l to C-44; M-l to C-43; M-l to C-42; M-l to R-41; M-l to S-40; M-l to S-39; M-l to 
P-38; M-l to Y-37; M-l to P-36; M-l to L-35; M-l to F-34; M-l to F-33; M-l to S-32; 
M-l to N-31; M-l to L-30; M-l to L-29; M-l to S-28; M-l to F-27; M-l to S-26; M-l to 
T-25; M-l to E-24; M-l to G-23; M-l to S-22; M-l to 1-21; M-l to F-20; M-l to S-19; 
M-l to S-18; M-l to F-17; M-l to W-16; M-l to S-15; M-l to L-14; M-l to A-13; M-l 
to M-12; M-l to C-ll; M-l to L-10; M-l to L-9; M-l to H-8; M-l to K-7; M-l to L-6; of 
SEQ ID NO:40. Polypeptides encoded by these polynucleotides are also encompassed by 
the invention. 

In addition, the invention provides nucleic acid molecules having nucleotide 
sequences related to extensive portions of SEQ ID NO:22 which have been determined 
from the following related cDNA genes: HPWCM76R (SEQ ID NO: 147). 

This gene is expressed primarily in prostate BPH (benign prostatic hyperplasia) 
tissue. Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identification of the tissue(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditions which include, but are not 
limited to, inflammation of the prostate, or related tissues. Similarly, polypeptides and 
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antibodies directed to these polypeptides are useful in providing immunological probes 
for differential identification of the tissue(s) or cell type(s). For a number of disorders of 
the above tissues or cells, particularly of the prostate, expression of this gene at 
significantly higher or lower levels is routinely detected in certain tissues or cell types 
(e.g. prostate, cancerous and wounded tissues) or bodily fluids (e.g., serum, plasma, 
urine, synovial fluid and spinal fluid) or another tissue or cell sample taken from an 
individual having such a disorder, relative to the standard gene expression level, i.e., the 
expression level in healthy tissue or bodily fluid from an individual not having the 
disorder. 

The tissue distribution in prostate BPH tissue indicates that polynucleotides and 
polypeptides corresponding to this gene are useful for the treatment of inflammatory 
conditions which result in an enlargement of the prostate, or related tissues. 
Polynucleotides and polypeptides corresponding to this gene are useful for the treatment 
and diagnosis of conditions concerning proper testicular function (e.g. endocrine 
function, sperm maturation), as well as cancer. Therefore, this gene product is useful in 
the treatment of male infertility and/or impotence. This gene product is also useful in 
assays designed to identify binding agents, as such agents (antagonists) are useful as 
male contraceptive agents. Similarly, the protein is believed to be useful in the treatment 
and/or diagnosis of testicular cancer. The testes are also a site of active gene expression 
of transcripts that is expressed, particularly at low levels, in other tissues of the body. 
Therefore, this gene product is expressed in other specific tissues or organs where it may 
play related functional roles in other processes, such as hematopoiesis, inflammation, 
bone formation, and kidney function, to name a few possible target indications. Based 
upon the tissue distribution of this protein, antagonists directed against this protein is 
useful in blocking the activity of this protein. Accordingly, preferred are antibodies 
which specifically bind a portion of the translation product of this gene. 
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Aiso provided is a kit for detecting tumors in which expression of this protein 
occurs. Such a kit comprises in one embodiment an antibody specific for the translation 
product of this gene bound to a solid support. Also provided is a method of detecting 
these tumors in an individual which comprises a step of contacting an antibody specific 
for the translation product of this gene to a bodily fluid from the individual, preferably 
serum, and ascertaining whether antibody binds to an antigen found in the bodily fluid. 
Preferably the antibody is bound to a solid support and the bodily fluid is serum. The 
above embodiments, as well as other treatments and diagnostic tests (kits and methods), 
are more particularly described elsewhere herein. The homology to the FAP protein 
indicates that the protein product of this gene is useful in treating, detecting, and/or 
preventing iron metabolism disorders, particularly those resulting in high oxidative 
states, tissue damage, athersclerosis, free radical damage, vascular disorders, iron binding 
protein dysfunction, nitric oxide synthase dysfunction or aberration, vasodilation 
disorders, and tissue edema. Based on the sequence similarity, the translation product of 
this gene is expected to share at least some biological activities with iron metabolism 
modulatory proteins. Such activities are known in the art, some of which are described 
elsewhere herein. Furthermore, the protein may also be used to determine biological 
activity, to raise antibodies, as tissue markers, to isolate cognate ligands or receptors, to 
identify agents that modulate their interactions, in addition to its use as a nutritional 
supplement. Protein, as well as, antibodies directed against the protein may show utility 
as a tumor marker and/or immunotherapy targets for the above listed tissues. 

Many polynucleotide sequences, such as EST sequences, are publicly available 
and accessible through sequence databases. Some of these sequences are related to SEQ 
ID NO:22 and may have been publicly available prior to conception of the present 
invention. Preferably, such related polynucleotides are specifically excluded from the 
scope of the present invention. To list every related sequence is cumbersome. 
Accordingly, preferably excluded from the present invention are one or more 
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polynucleotides comprising a nucleotide sequence described by the general formula of a- 
b, where a is any integer between 1 to 793 of SEQ ID NO:22, b is an integer of 15 to 
807, where both a and b correspond to the positions of nucleotide residues shown in SEQ 
ID NO: 22, and where b is greater than or equal to a + 14. 
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Res Position 

I 

II 

III 

IV 

V 

VI 

VII VIII 

IX 

X 

XI 

XII 

XIII 


Met 

1 



B 




-0.10 

0.44 




-0.40 0.48 


Val 

2 



B 




O.OS 

0,01 




0.15 0.63 

5 

Ala 

3 



B 




0.47 

0.01 




0.40 0.76 


Gin 

4 



B 




0.51 

-0.01 




1.40 1.33 


Asp 

5 






T 

C 0.23 

-0.20 



F 

2.20 1.77 


Pro 

6 





T 

T 

. 0.02 

-0.27 


* 

F 

2.50 0.94 


Gin 

7 





T 

T 

0.8S 

-0.09 

* 

* 

F 

2.25 0.45 

10 

Gly 

S 





T 

T 

0.66 

-0.09 


* 

F 

2.00 0.47 


Cys 

9 


A 

B 




-0.01 

0.60 


* 


-0.10 0.25 


Leu 

10 


A 

B 




-0.82 

0.74 




-0.35 0.08 


Gin 

11 


A 

B 




. -0.91 

1.03 




-0.60 0.06 


Leu 

12 


A 

B 




-0.91 

0.99 


* 


-0.60 0.16 

15 

Cys 

13 


A 

B 




-1.42 

0.41 




-0.60 0.34 


Leu 

14 


A 

B 




-1.34 

0.37 




-0.30 0.14 


Ser 

15 


A 

B 




-0.53 

0.47 

* 

* 


-0.60 0.18 


Glu 

16 


A 

B 




-0.88 

0.19 

* 



-0.30 0.53 


Val 

17 



B 



T 

-0.88 

0.04 


* 


0.10 0.63 

20 

Ala 

18 

A 





T 

-0.10 

0.04 




0.10 0.39 


Asn 

19 





T 

T 

0.71 

-0.34 

* 



1.31 0.44 


Gly 

20 





T 

T 

0.80 

0.06 


* 

F 

1.07 0.96 


Leu 

21 







C -0.06 

-0.16 


* 

F 

1.63 1.47 


Arg 

22 







C 0.50 

-0.01 


* 

F 

1.69 0.68 

25 

Asn 

23 






T 

C 0.49 

-0.03 

* 


F 

2.10 0.92 


Pro 

24 



B 



T 

-0.37 

0.16 

* 


F 

1.24 1.10 


Val 

25 



B 



T 

-0.06 

0.11. 

* 

* 


0.73 0.42 


Ser 

26 



B 



T 

0.17 

0.61 

* 

* 


0.22 0.35 


Met 

27 



B 

B 



-0.29 

0.71 

* 



-0.39 0.23 

30 

Val 

28 



B 

B 



-0.29 

0.71 




-0.60 0.31 


His 

29 



B 

B 



-0.42 

0.07 




0.00 0.38 


Ala 

30 

A 


B 




0.12 

0.11 




0.50 0.38 


Gly 

31 





T 

T 

. 0.39 

-0.0 1 

* 


F 

2.15 0.74 


. Asp 

32 





T 

" t" 

1 .10 

-0.16 

* 

* 

F 

2.45 0.74 

35 

Gly 

33 






T 

C 1.26 

-0.66 

* 

* 

F 

3.00 1.44 


Thx 

34 






T 

C 0.59 

-0.37 

* 


F 

2.40 1.26 


His 

35 



B 

B 



0.32 

-0.01 

* 



1.20 0.65 


Arg 

36 



B 

B 



0.08 

0.63 




0.00 0.49 


Phe 

37 



B 

B 



0.08 

0.70 




-0.30 0.34 

40 

Phe 

38 



B 

B 



0.42 

0.21 


* 


-0.30 0.44 
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-0.10 


* 


0.50 0.24 

Cys 

247 



B 



T 


-0.53 

0.11 


* 


0.40 0.09 

Gly 

248 



B 



T 


-0.19 

0.54 




0.40 0.13 

Asp 

249 





T 

T 


-0.30 

0.26 



F 

1.55 0.26 

Val 

250 






T 

C 

0.70 

-0.13 



F 

2.25 0.77 

Gly 

251 






T 

C 

0.70 

-0.70 


* 

F 

3.00 1.53 

Gin 

252 






T 

c 

1.37 

-0.34 

* 


F 

2.25 0.80 

Asn 

253 






T 

c 

1.71 

-0.34 

* 


F 

2.10 1.86 

Arg 

254 



B 



T 


0.86 

-0.99 

* 


F 

1.90 3.25 

Phe 

255 

A 

A 






1.71 

-0.77 



F 

1.20 1.39 

Glu 

256 

A 

A 






1.24 

-1.17 


* 

F 

0.90 1.45 

Glu 

257 

A 

A 






0.36 

-0.89 




0.60 0.61 

Val 

258 

A 

A 


B 




-0.46 

-0.20 

* 

* 


0.30 0.49 

Asp 

259 

A 

A 


B 




-0.52 

-0.30 


# 


0.30 0.23 

Leu 

260 

A 

A 


B 




-0.17 

-0.30 

* 

* 


0.30 0.27 

lie 

261 

A 

A. 


B 




-0.51 

0.13 

* 

* 


-0.30 0.36 

Leu 

262 

A 





T 


-0.51 

-0.09 

* 

* 


0.70 0.21 

Lys 

263 





T 

T 


0.10 

0.31 


* 

F 

0.65 0.42 

Gly 

264 





T 

T 


-0.24 

0.39 

* 


F 

0.65 0.93 

Gly 

265 





T 

T 


0.28 

0.13 

* 

* 

F 

0.80 1.12 

Asn 

266 






T 

c 

1.28 

0.36 

* 

* 

F 

0.45 0.59 

Tyr 

267 






T 

c 

1.50 

0.36 

* 

* 


0.45 1.17 

Gly 

268 






T 

c 

1.50 

0.43 

* 

* 


0.15 1.19 

Trp 

269 



B 



T 


1.84 

0.00 

* 

* 


0.85 1.48 

Arg 

270 


A 

B 





1.84 

-0.40 

* 

* 


0.45 1.63 

Ala 

271 


A 

B 





1.14 

-0.73 

* 

* 

F 

0.90 1.63 

Lys 

272 

A 

A 






o.so 

-0.37 

* 

* 

F 

0.60 1.35 

Glu 

273 

A 

A 






0.48 

-0.79 

* 


F 

0.75 0.69 

Gly 

274 

A 

A 






0.52 

-0.21 


* 

F 

0.45 0.37 

Phe 

275 

A 

A 






0.41 

0.04 

m 

* 


-0.30 0.29 

Ala 

276 

A 

A 






1.04 

0.04 

* 



-0.30 0.28 

Cys 

277 

A 





T 


1.04 

0.04 

* 



0,10 0.56 

Tyr 

278 

A 





T 


0.23 

-0.39 

* 



0.85 1.30 

Asp 

279 

A 





T 


-0-09 

-0.49 

* 


F 

1.00 1.06 

Lys 

280 

A 





T 


0.58 

-0.41 

* 


F 

1.00 1.06 

Lys 

281 

A 

A 






1.17 

-0.49 



F 

0.45 0.92 

Leu 

282 

A 

A 






1.24 

-0.84 




0.60 0.89 

Cys 

283 

A 

A 






1.19 

-0.34 


* 


0.30 0.45 

His 

284 


A 

B 





0.38 

0.04 




-0.30 0.30 
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Asn 

285 


B 



T 

0.33 

0.73 


* 


-0.20 0.30 


Ala 

286 

A 




T 

0.29 

0.04 


* 


0.10 0.94 


Ser 

287 

A 




T 

0.24 

-0.53 




1.15 1.15 


Leu 

288 

A 




T 

0.10 

-0.39 



F 

0.85 0.53 

5 

Asp 

289 


B 

B 



-0.08 

-0.10 



F 

0.45 0.43 


Asp 

290 


B 

B 



-0.97 

-0.17 



F 

0.45 0.50 


Val 

291 


B 

B 



-0.62 

0.13 




-0.30 0.42 


Leu 

292 


B 

B 



-0.91 

0.20 




-0.30 0.40 


Pro 

293 


B 

B 



-0.34 

0.70 




-0.60 0.24 

10 

He 

294 


B 

B 



-0.69 

1.46 " 




-0.60 0.51 


Tyr 

295 


B 



T 

-0.72 

1.24 




-0.20 0.61 


Ala 

296 


B 



T 

-0.46 

1.06 




-0.20 0.54 


Tyr 

297 


B 



T 

-0.50 

1.13 




-0.20 0.77 


Gly 

298 


B 



T 

-0.63 

1.09 

* 



-0.20 0.37 

15 

His 

299 


B 

B 



0.30 

0.76 

* 



-0.60 0.36 


Ala 

300 


B 

B 



0.24 

0.26 




-0.30 0.46 


Val 

301 


B 

B 



-0.02 

-0.1 1 

* 



0.30 0.62 


Gly 

302 


B 



T 

-0.09 

0.10 

* 


F 

0.25 0.34 


Lys 

303 


B 



T 

-0.09 

0.09 

* 


F 

0.25 0.48 

20 

Ser 

304 


B 



T 

-0.40 

0.01 



F 

0.25 0.64 


Val 

305 


B 



T 

-0.06 

-0.20 



F 

0.85 0.64 


Thr 

306 


B 



T 

-0.06 

0.13 



F 

0.25 0.50 


Gly 

307 


B 



T 

0.04 

0.77 



F 

-0.05 0.28 


Gly 

308 


B 



T 

0.11 

1.14 



F 

-0.05 0.S9 

25 

Tyr 

309 


B 



T 

0.07 

0.50 




-0.20 0.80 


Val 

310 


B 




0.26 

0.44 

* 



-0.40 0.80 


Tyr 

311 


B 



T 

0.57 

0.59 

* 



-0.20 0.43 


Arg 

312 


B 



T 

0.61 

0.16 

* 



0.10 0.48 


Gly 

313 


B 



T 

0.74 

-0.21 

* 


F 

1.13 0.87 

30 

Cys 

314 


B 



T 

0.99 

-0.43 

* 

* 

F 

1.41 0.85 


Glu 

315 


B 




1.03 

-0.79 

* 

* 

F 

1.79 0.70 


Ser 

316 





T 

C 1.28 

-0.10 


* 

F 

2.17 0.58 


Pro 

317 




T 

T 

0.82 

-0.13 



F 

2.80 1.75 


Asn 

318 




T 

T 

0.36 

-0.27 



F 

2.52 1.00 

35 

Leu 

319 




T 

T 

0.78 

0.41 



F 

1.19 0.62 


Asn 

320 



B 

T 


-0.11 

0.79 




0.360.63 


Gly 

321 


B 

B 



-0.51 

1.04 




-0.32 0.27 


Leu 

322 


B 

B 



-0.64 

1.43 




-0.60 0.29 


Tyr 

323 


B 

B 



-0.64 

1.17 


* 


-0.60 0.18 

40 

He 

324 


B 

B 



-0.53 

0.77 




-0.60 0.30 


Phe 

325 


B 

B 



-1.13 

1.13 




-0.60 0.31 
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Gly 

326 



B 

B 


Asp 

327 



B 



Phe 

328 



B 



Met 

329 

A 




5 

Ser 

330 






Giy 

331 

A 





Arg 

332 

A 





Leu 

333 

A 

A 




Met 

334 

A 

A 



10 

Ala 

335 

A 

A 




Leu 

336 

A 

A 




Gin 

. 337 

A 

A 




Glu 

338 

A 

A 




Asp 

339 

A 

A 



15 

Arg 

340 

A 

A 




Lys 

341 

A 

A 




Asn 

342 

A 





Lys 

343 

A 





Lys 

344 

A 




20 

Trp 

345 

A 





Lys 

346 

A 

A 




Lys 

347 

A 

A 




Gin 

348 


A 

B 



Asp 

349 


A 

B 


25 

Leu 

350 


A 

B 



Cys 

351 


A 

B 



Leu 

352 


A 

B 



Gly 

353 


A 


T 


Ser 

354 




T 

30 

Thr 

355 




T 


Thr 

356 



B 



Ser 

357 



B 



Cys 

358 



B 



Ala 

359 



B 


35 

Phe 

360 



B 



Pro 

361 



B 



Gly 

362 




T 


Leu 

363 



B 



lie 

364 



B 


40 

Ser 

365 

A 





Thr 

366 

A 






-1.09 

. 1.06 

* 



-0.60 0.20 


-0.62 

0.69 


* 


-0.40 0.38 


-0.27 

0.43 


* 


-0.40 0.43 

T 

-0.19 

-0.36 


* 


0.70 0.85 

T 

C -0.09 

-0.10 


* 

F 

1.05 0.42 

T 

-0.33 

0.51 



F 

-0.05 0.48 

T 

-1.14 

0.23 




0.10 0.49 


-0.44 

0.30 




-0.30 0.30 


0.16 

0.31 


* 


-0.30 0.53 


0.46 

-0.11 




0.30 0.47 


0.91 

-0.11 


* 


0.30 0.95 


0.84 

-0.80 




0.75 1.87 


1.66 

-1.41 



F 

0.90 3.71 


2.30 

-1.51 

* 


F 

0.90 7.23 


2.93 

-2.20 



F 

0.90 8.35 


3.46 

-2.60 



F 

0.90 9.64 

T 

. 3.50 

-1.69 

* 


F 

1.30 6.07 

T 

3.54 

-1.69 

* 


F 

1.30 6.20 

T 

3.54 

-1.69 

* 


F 

1.30 6.20 

T 

3.43 

-1.29 

* 


F 

1.30 6.68 


2.58 

-1.69 



F 

0.90 5.58 


1.91 

-1.00 



F 

0.90 2.30 


1.06 

-0.43 



F 

0.60 1.17 


0,67 

-0.66 



F 

0.75 0.48 


0.66 

-0.23 



F 

0.45 0.24 


030 

0.16 



F 

-0.15 0.19 


-0.06 

0.24 



F 

-0.15 0.16 


-0.36 

0.73 



F 

-0.05 0.28 

T 

-1.02 

0.43 



F 

0.35 0.70 

T 

-0.80 

0.43 



F 

0.35 0.45 

T 

-0.83 

0.24 



F 

0.25 0.46 

T 

-0.23 

0.60 



F 

-0.05 0.30 


-0.23 

0.64 




-0.40 0.32 


-0.74 

0.59 




-0.40 0.22 

T 

-1.32 

0.79 




-0.20 0.14 

T 

-1.31 

1.09 




-0.20 0.18 

T 

-1.32 

0.90 




0.20 0.24 

T 

-0.69 

0.89 




-0.20 0.39 


-0.40 

0.60 

* 



-0.40 0.35 

T 

0.34 

0.56 

* 

* 


-0.20 0.47 

T 

-0.14 

0.13 

* 


F 

0.40 1.13 


SUBSTITUTE SHEET (RULE 26) 
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214 



His 

367 

A 





T 


-0.69 

0.23 

* 


F 

0.40 1.40 


Ser 

368 



B 



T 


-0.77 

0.23 


* 

F 

0.25 0.73 


Lys 

369 



B 

B 




-0.18 

0.53 

* 

* 


-0.60 0.36 


Phe 

370 



B 

B 




-0.58 

0.43 

* 

* 


-0.60 0.35 

5 

He 

371 



B 

B 




-0.86 

0.71 

* 

* 


-0.60 0.23 


He 

372 



B 

B 




-0.82 

0.83 

* 

» 


-0.60 0.11 


Ser 

373 


A 

B 





-0.52 

0.83 

* 



-0.60 0.23 


Phe 

374 

A 

A 






-0.57 

0.04 

* 

* 


-0.30 0.54 


Ala 

375 

A 

A 






-0.46 

-0.64 




0.75 1.35 

10 

Glu 

376 

A 

A 






0.09 

-0.83 

* 



0.75 1.01 


Asp 

377 

A 

A 






0.98 

-0.79 

* 


F 

0.90 1.16 


Glu 

378 

A 

A 






0.47 

-1.57 



F 

0.90 1.99 


Ala 

379 

A 

A 






0.92 

-1.39 



F 

0.75 0.95 


Gly 

380 

A 

A 






0.81 

-0.63 



F 

0.75 0.89 

15 

Glu 

381 

A 

A 


B 




0.00 

0.16 




-0.30 0.44 


Leu 

382 

A 

A 


B 




-0.59 

0.84 




-0.60 0.36 


Tyr 

383 

A 

A 


B 




-0.90 

0.84 




-0.60 0.37 


Phe 

384 

A 

A 


B 




-0.61 

0.90 




-0.60 0.31 


Leu 

385 


A 

B 

B 




-0.51 

1.29 




-0.60 0.50 

20 

Ala 

386 


A 

B 

B 




-0.72 

1.36 




-0.60 0-50 


Thr 

387 


A 

B 

B 




-0.21 

1.03 




-0.60 0,89 


Ser 

388 


A 





C 

-0.56 

0.63 



F 

-0.10 1.45 


Tyr 

389 






T 

C 

-0.10 

0.44 



F 

0.30 1.45 


Pro 

390 





T 

T 


0.12 

0.70 



F 

0.50 1.58 

25 

Ser 

391 





T 

T 


0.50 

0.71 




0.35 1.19 


Ala 

392 



B 



T 


0.92 

0.76 




0.08 1.17 


Tyr 

393 



B 





0.88 

0.00 




0.91 1.49 


Ala 

394 



B 



T 


0.82 

0.00 


* 


1.24 1.10 


Pro 

395 



B 



T 


0.14 

0.00 



F 

1.52 1.46 

30 

Arg 

396 





T 

T 


0.20 

0.19 



F 

1.30 0.65 


Gly 

397 



B 



T 


0.83 

0.19 



F 

0.92 1.01 


Ser 

398 



B 

B 




0.38 

-0.31 



F 

0.99 1.31 


He 

399 



B 

B 




0.11 

0.04 

* 



-0.04 0.58 


Tyr 

400 



B 

B 




0.32 

0.69 


* 


-0.47 0.43 

35 

Lys 

401 



B 

B 




0.00 

0.26 

* 

* 


-0.30 0.54 


Phe 

402 



B 

B 




0.04 

0.30 

* 



0.19 1.19 


Val 

403 



B 

B 




0.46 

0.00 

* 


F 

1.28 1.02 


Asp 

404 



B 



T 


1.46 

-0.76 

* 


F 

2.17 1.00 


Pro 

405 



B 



T 


1.11 

-0.76 

* 


F 

2.66 2.26 

40 

Ser 

406 





T 

T 


0.86 

-1.04 

* 


F 

3.40 3.07 


Arg 

407 





T 

T 


1.34 

-1.26 



F 

3.06 2.85 
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Arg 

408 





T 

Ala 

409 






Pro 

410 






Pro 

411 





T 

Gly 

412 





T 

Lys 

413 





T 

Cys 

414 





T 

Lys 

415 



B 



Tyr 

416 



B 



Lys 

417 



B 



Pro 

413 



B 

B 


Val 

419 



B 

B 


Pro 

420 



B 

B 


Val 

421 



B 

B 


Arg 

422 



B 

B 


Thr 

423 



B 



Lys 

424 





T 

Ser 

425 





T 

Lys 

426 





T 

Arg 

427 



B 



He 

428 



B 



Pro 

429 



B 



Phe 

430 



B 



Arg 

431 


A 

B 



Pro 

432 


A 

B 



Leu 

433 

A 

A 




Ala 

434 

A 

A 




Lys 

435 

A 

A 




Thr 

436 

A 

A 




Val 

437 

A 

A 




Leu 

438 

A 

A 




Asp 

439 

A 

A 




Leu 

440 

A 

A 




Leu 

441 

A 

A 




Lys 

442 

A 

A 




Glu 

443 

A 

A 




Gin 

444 

A 

A 




Ser 

445 

A 

A 




Glu 

446 

A 

A 




Lys 

447 

A 

A 




Ala 

448 

A 

A 





215 



1.86 

-0.83 



F 

2.86 2.85 


C 1.90 

-0.83 



F 

2.66 2.10 

T 

C 1.44 

-1.21 

* 


F 

2.86 2.15 

T 

1.79 

-0.64 

* 

* 

F 

2.91 0.59 

T 

1.43 

-0.64 


■* 

F 

3.40 1.16 

T 

1.37 

-0.39 


* 

F 

2.76 1.18 

T 

1.74 

-0.81 


* 

F 

2.72 1.52 

T 

1. 10 

-0.81 


* 

F 

1.98 2.38 

T 

1.10 

-0.60 


* 

F 

1.49 0.88 

T 

0.59 

-0.17 


* 

F 

1.00 2.55 


0.66 

-0.10 


* 

F 

0.45 0.95 


1.01 

-0.10 


* 

F 

0.60 1.18 


1.01 

-0.37 


* 

F 

0.79 0.85 


0.96 

-0.37 


* 

F 

1.28 1.10 


0.96 

-0.41 


* 

F 

1.62 1.99 

T 

1.2S 

-1.06 

* 

* 

F 

2.66 2.58 

T 

1.24 

-1.49 

* 

* 

F 

3.40 6.80 

T 

1.24 

-1.44 

* 

* 

F 

3.06 2.43 

T 

1.40 

-1.01 


* 

F 

2.72 2.61 


1.40 

-0.71 


* 

F 

1.78 1.13 


1.50 

-0.71 

* 


F 

1.44 1.65 


0.64 

-0.67 

* 

* 


0.95 1.28 


0.36 

0.01 

* 

* 


-0.10 0.54 


0.36 

0.51 

* 

* 


-0.60 . 0.77 


-0.07 

-0.17 

He 

* 

F 

0.60 1.00 


-0.03 

-0.11 

* 

* 

F 

0.60 t.67 


-0.63 

-0.26 


* 

F 

0,45 0.63 


0.07 

0.43 

* 


F 

-0.45 0.34 


-0.86 

0.00 


* 


0.30 0.68 


-1.46 

0.00 

* 



0.30 0.56 


-0.60 

0.19 

* 



-0.30 0.23 


-0.01 

0.19 

* 



-0.30 0.32 


-0.06 

-0.30 

* 



0.30 0.74 


-0.04 

-0.54 



F 

0.90 1.56 


0.81 

-0.84 

* 


F 

0.90 1.25 


1.67 

-0.84 

* 


F 

0.90 2.63 


1.08 

-1.53 

* 


F 

. 0.90 6.38 


1.30 

-1.71 



F 

0.90 3.22 


2.22 

-1.21 



F 

0.90 1.88 


2.22 

-1.21 

* 


F 

0.90 2.13 


1.92 

-1.61 

* 


F 

0.90 3.17 
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Ala 

449 

A 

A 





1.62 

-1.61 


F 

0.90 2.46 


Arg 

450 

A . 

A 





1.62 

-1.23 


F 

0.90 1.65 


Lys 

451 

A 

A 





1.03 

-0.84 


F 

0.90 2.18 


Ser 

452 

A 




T 


0.68 

-0.84 


F 

1.30 2.18 

5 

Ser 

453 

A 




T 


046 

-0.86 


F 

1.30 1.61 


Ser 

454 



B 


T 


046 

-0.17 


F 

0.85 0.66 


Ala 

455 



B 


T 


0.04 

0.33 


F 

0.25 0.50 


Thr 

456 



B 




-0.34 

0.33 



-0.10 0.50 


Leu 

457 



B 




-0.26 

0.37 



-0.10 0.37 

10 

Ala 

458 



B 


T 


-0.54 

0.41 


F 

-0.05 0.57 


Ser 

459 



B 


T 


-0.24 

0.41 


F 

-0.05 0.40 


Gly 

460 





T 

C 

0.00 

0.33 


F 

0.45 0.83 


Pro 

461 





T 

C 

-0.50 

0.07 


F 

0.45 0.81 


Ala 

462 






C 

0.01 

0.26 


"F 

0.25 0.50 

15 

Gin 

463 

A 






0.60 

0.26 


F 

0.05 0.68 


Gly 

464 



B 




0.94 

-0.17 


F 

0.65 0.76 


Leu 

465 



B 




0.94 

-0.60 


F 

1,10 1.50 


Ser 

466 

A 






0.86 

-0.67 


F 

0.95 0.86 


Glu 

467 

A 






1.14 

-0.69 


F 

1.10 1.16 

20 

Lys 

468 

A 






1.19 

-0.73 


F 

1.10 1.89 


Gly 

469 

A 



T 

T 


1.58 

-1.41 


F 

1.70 2.82 


SeT 

470 

A 




T 


1.58 

-1.80 


F 

1.30 3.26 


Ser 

471 

A 




T 


L29 

-1.11 


F 

1.30 1.34 


Lys 

472 

A 




T 


0.99 

-0.61 


F 

1.30 1.37 

25 

Lys 

473 



B 




0.73 

-0.66 


F 

1.10 1.37 


Leu 

474 



B 




0.77 

-0.61 


F 

1.10 1.58 


Ala 

475 



B 




0.77 

-0.51 


F 

1.40 1.14 


Ser 

476 



B 


T 


0.77 

-0.13 


F 

1.45 0.77 


Pro 

477 



B 


T 


0.77 

0,26 


F 

1.30 1.24 

30 

Thr 

478 




T 

T 


0.72 

-0.43 


F 

2.60 2.46 


Ser 

479 





T 

C 

1.22 

-0.53 


F 

3.00 2.95 


Ser 

480 




T 

T 


1.00 

-0.43 

* * 

F 

2.60 2.76 


Lys 

481 



B 


T 


1.41 

-0.17 

* * 

F 

1.90 1.58 


Asn 

482 



B 


T 


1.28 

-0.66 

* * 

F 

1.90 2.30 

35 

Thr 

483 



B 


T 


1.38 

-0.61 

* * 

F 

1.94 1.70 


Leu 

484 



B 




1.33 

-0.57 

* * 

F 

1.78 1.32 


Arg 

485 



B 




1.32 

-0.14 

* * 

F 

1.67 0.81 


Gly 

486 



B 


T 


1.32 

-0.06 

* 

F 

2.21 0.81 


Pro 

487 




T 

T 


1.37 

-0.54 

* 

F 

3.40 1.96 

40 

Gly 

488 




T 

T 


1.72 

-1.23 

* 

F 

3.06 2.00 


Thr 

489 





T 

c 

1.94 

-1.23 

* 

F 

2.52 4.05 
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Lys 

490 


A 

B 




1.94 

-1.16 


* 

F 

1.58 2.65 

Lys 

491 


A 

B 




1.43 

-1.59 

* 

* 

F 

1 .24 5.24 

Lys 

492 


A 

B 




1.30 

-1.37 

* 

* 

F 

0.90 2.69 

Ala 

493 


A 

B 




1.43 - 

-1.43 

* 


F 

0.90 1.33 

Arg 

494 


A 

B 




1.71 

-1.00 

* 

* 

F 

0.90 1.03 

Vat 

495 


A 

B 




0.81 

-0.50 

* 

* 

F 

0.75 0.70 

Gly 

496 



B 


T 


0.88 

0.14 


* 

F 

0.25 0.52 

Pro 

497 



B 


T 


0.83 

-0.36 

* 

* 

F 

0.85 0.52 

His 

498 



B 


T 


1.08 

0.04 

* 

* 

p 

0.74 1.20 

Vat 

499 



B 


T 


1.01 

-0.17 

* 

* 

F 

1.68 1.20 

Arg 

500 



B 


T 


1.98 

-0.60 

* 

* 

F 

2.32 1.55 

Gin 

501 



B 


T 


2.43 

-1.03 

* 


p 

2.66 2.24 

Gly 

502 




T 

T 


2.69 

-1.53 

* 


F 

3.40 5.90 

Lys 

503 

A , 




T . 


2.42 

-2.17 



p 

2.66 6.03 

Arg 

504 

A 






2.47 

-1.79 


* 

F 

2.12 4.66 

Arg 

505 



B 




2.40 

-1.50 



F 

1.78 3.89 

Lys 

506 



B 




2.10 

-1.93 


* 

F 

1.44 3.89 

Ser 

507 



B 




2.41 

-1.54 



F 

1 .44 2.66 

Leu 

508 



B 




2.07 

-1.04 



p 

1.78 1.85 

Lys 

509 



B 




1.61 

-0.66 



p 

2.12 1.24 

Ser 

510 







1.61 

-0.23 

* 

* 

p 

2.21 0.91 

His 

511 




T 

T 


0.97 

-0.61 



F 

3.40 2.17 

Ser 

512 





T 

c 

1.38 

-0.69 



p 

2.86 1,07 

Gly 

513 




T 

T 


1.98 

-0.69 


* 

F 

3.02 1.57 

Arg 

514 




T 

T 


1.63 

-0.64 


* 

F 

2.98 1.78 

Met 

515 






c 

1.34 

-0.76 


* 

F 

2.54 1.78 

Arg 

516 





T 

c 

1.38 

-0.64 


* 

F 

2.70 1.82 

Pro 

517 





T 

c 

1.68 

-1.07 


* 

F 

3.00 1.61 

Ser 

518 

A 




T 


2.07 

-0.67 


* 

F 

2.50 2.82 

Ala 

519 

A 




T 


2.07 

-1.29 


* 

F 

2.20 2.88 

Glu 

520 

A 

A 





2.08 

-1.29 

* 

* 

F 

1 .50 3.65 

Gin 

521 

A 






1.62 

"-t.21 

* 

* 

p 

1.51 2.75 

Lys 

522 

A 

A 





1.94 

-1.17 



F 

1.52 2.69 

Arg 

523 

A 

A 





1.94 

-1.67 

* 


F 

1.83 3.04 

-Ala 

524 


A 


T 



1.72 

-1.29 

* 


F 

2.54 2.36 

Gly 

525 




T 

T 


1.51 

-1.00 

* 


F 

3.10 0.97 

Arg 

526 




T 

T 


1.12 

-0.57 

• * 


F 

2.79 0.77 

Ser 

527 





T 

c 

0.69 

-0.14 

* 


F 

1.98 0.97 

Leu 

528 





T 

c 

0.19 

-0.21 

* 



1.67 1.25 

Pro 

529 



B 




0.39 

-0.21 

* 



0.81 0.82 

Ter 

530 




T 



0.34 

0.21 

* 



0.30 0.78 
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Table II 


Res Position 

11 

III 

IV V 

VI 

VII 

VIII 

IX 

X 

XI 

XII 

XIII 

XI 

Met 

1 





C 

0.69 

-0.24 


+ 


1.19 1.74 

Arg 

2 





c 

0-38 

-0.24 


* 


1.53 1.34 

Pro 

3 




T 

c 

0.38 

0.11 




1.32 0.91 

Pro 

4 



T 

T 


1.27 

-0.31 




2.61 1.80 

Gly 

5 



T 

T 


0.96 

-0.53 



F 

3.40 1.48 

Phe 

6 



T 

T 


0.74 

0.26 



F 

2.01 0.83 

Arg 

7 

A 

B 




-0.18 

0.51 




0.42 0.44 

Asn 

8 

A 

B 




-0.78 

0.77 

* 



0.08 0.37 

Phe 

9 

A 

B 




-1.16 

1.03 

* 



-0.26 

0.35 

Leu 

10 

A 

B 




-1.11 

0.74 

* 



-0.60 

0.18 

Leu 

11 

A 




c 

-0.71 

1.13 

* 



-0.40 

0.15 

Leu 

12 

A 




c 

-1.63 

1.11 

* 



-0.40 

0.23 

Ala 

13 

A 




c 

-2.44 

1.01 




-0.40 

0.23 

Ser 

14 

A 




c 

-2.44 

1.01 




-0.40 

0.23 

Ser 

15 

A 




c 

-2.22 

1.11 




-0.40 

0.24 

Leu 

16 

A 

B 




-1.76 

0.93 




-0.60 

0.24 

Leu 

17 

A 

B 




-1.76 

0.86 




-0.60 

0.18 

Phe 

18 

A 

B 




-1.47 

1.16 




-0.60 

0.11 

Ala 

19 

A 




c 

-1.76 

1.16 




-0.40 

0.18 

Gly 

20 

A 




c 

-2.31 

0.97 




-0.40 

0.22 

Leu 

21 

A 




c 

-1.71 

0.93 

* 



-0.40 

0.19 

Ser 

22 

A 




c 

-0.90 

0.57 

* 



-0.40 

0.29 

Ala 

23 

A 




c 

-0.50 

0.47 

* 



-0.40 

0.51 

Val 

24 

A 




c 

-0.61 

0.43 

* 



-0.40 

0.83 

Pro 

25 




T 

c 

-0.57 

0.53 



F 

0.15 0.53 

Gin 

26 



T 

T 


0.03 

0.53 



F 

0.35 0.71 

Ser 

27 



T 

T 


0.03 

0.46 

* 


F 

0.50 1.48 

Phe 

28 




T 

c 

-0.19 

0.20 

* 

* 

F 

0.60 1.28 

Ser 

29 




T 

c 

0.78 

0.46 


* 

F 

0.15 0.61 

Pro 

30 




T 

c 

0.69 

0.06 

* 


F 

0.45 0.89 

Ser 

31 



T 

T 


0.40 

0.06 

* 

* 

F 

0.80 1.38 

Leu 

32 



T 

T 


0.49 

0.19 

* 

* 

F 

0.80 

.08 

Arg 

33 



T 



0.84 

0.23 

* 

* 

F 

0.60 

.08 

Ser 

34 



T 



0.56 

0.23 

* 


F 

0.45 0.80 

Trp 

35 




T 

c 

0.18 

0.34 

* 


F 

0.45 0.98 

Pro 

36 




T 

c 

-0.19 

0.16 

* 

* 

F 

0.45 0.50 

Gly 

37 



T 

T 


0.73 

0.73 


* 


0.20 0.20 
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Ata 

38 




T 

T 


-0.19 

0.34 

* 

* 


0.50 0.38 


Ala 

39 






c 

-0.19 

0.11 

* 



0.10 0.20 


Cys 

40 


B 


T 



0.21 

0.07 

* 



0.30 0.27 


Arg 

41 

A 

B 





-0.17 

-0.36 

* 



0.30 0.53 

5 

Leu 

42 

A 





c 

0.18 

-0.36 

* 



0.50 0.53 


Ser 

43 

A 





c 

0.47 

-0.86 

* 

* 


0.95 1.70 


Arg 

44 

A 





Q 

1.06 

-1.04 

* 


F 

1 10 1 16 


Ala 

45 

A 





Q 

1.83 

-1.04 


* 

p 

1 10 2 44 


Glu 

46 

A 



T 



1.83 

-1.73 


* 

p 

1.30 3.57 

10 

Ser 

47 

A 



T 



1.98 

-2. 11 


* 

p 

1 30 3.57 


Glu 

48 

A 



T 



2.39 

-1.54 


* 

p 

1 30 1 89 


Arg 

49 

A 



T 



1.69 

-2.04 



F 



Are 

50 

A 



T 



2.07 

-1.54 

* 

* 

F 

1.58 1 .62 


Cys 

51 

A 



T 



1.72 

-1.50 





15 

Arg 

52 

A 



T 



2.02 

-1.07 



p 

I 99 0 73 


Ala 

53 





T 

c 

1.81 

-0.67 

* 

* 

F 

2.47 0.64 


Pro 

54 




T 

T 


1.49 

-0.24 

* 

* 

p 

ISA 1 
Z.OU l.Ou 


Glv 

55 




T 

T 


1.03 

-0.39 



p 

2.52 1.47 


Gin 

56 





T 

Q 

1.1 i 

0.04 

* 


p 

1.44 1.44 

20 

Fro 

57 





T 

Q 

0 41 

0 04 

* 

* 

p 

1 01 0 94 


Pro 

58 








Oil 




0 93 0 96 


Glv 

59 




T 

T 


-0.27 

0.37 



F 

ft fis n Ad 


Ala 

60 





T 


0.04 

0,54 




-fi ?n ft i ft 


Ala 

61 


3 





-0.30 

0.61 


* 


-0.40 0.14 

25 

Leu 

62 
63 




T 



0.02 
-0.1 1 

0.61 
0.19 


* 
* 


-0.40 0. 1 4 
0.61 0.27 


His 

64 




T 

T 


0.34 

0.11 




1.12 0.26 


Gly 

65 




T 

T 


0.27 

-0.39 


* 

F 

2.18 0.63 


Arg 

66 




T - 

T 


0.86 

-0.50 


* 

F 

2.49 0.63 

30 

Gly 

67 




T 

T 


1.00 

-1.07 


* 

F 

3.10 0.77 


Arg 

6S 




T 



1.32 

-1.00 



F 

2.59 0.42 


Cys 

69 




T 

T 


0.50 

-1.00 




i n f\ 1 1 


Asp 

70 




T 

T 


0.18 

-0.36 


* 


1.72 0.16 


Cys 

71 




T 

T 


-0.82 

-0.21 


* 


1.41 0.04 

35 

Gly 

72 




T 

T 


-1.14 

0.47 


* 


0.20 0.06 


Val 

73 



B 

T 



-1.29 

0.47 


* 


-0.20 0.02 


Cys 

74 


B 

B 




-1.48 

0.97 




-0.60 0.05 


lie 

75 


B 

B 




-1.79 

1.04 




-0.60 0.03 


Cys 

76 


B 

B 




-1.12 

1.10 




-0.60 0.07 

40 

His 
Val 

77 
78 


B 

B 
B 

T 



-0.99 
-0.48 

0.46 
. 0.31 




-0.60 0.22 
0.10 0.48 


SUBSTITUTE SHEET (RULE 26) 


WO 00/29435 


PCT/US99/25031 



Thr 

79 


B 


C 

-0.41 

0.06 



F 

0.05 0.89 


Glu 

80 



T 

C 

-0.22 

0.10 

* 


F 

0.45 0.64 


Pro 

81 


T 

T 


-0.26 

0.39 



F 

0.65 0.75 


Gly 

82 


T 

T 


-0.57 

0.53 




0.20 0.45 

5 

Met 

83 


T 

T 


0.08 

0.47 

* 



0.20 0.26 


Phe 

84 


T 



-0.42 

0.90 




0.00 0.26 


Phe 

85 


T 



-1.09 

1.16 




0.00 0.21 


Gly 

86 



T 

C 

-0.88 

1.30 




0.00 0.12 


Pro 

87 



T 

c 

-1.20 

0.69 




0.00 0.23 

10 

Leu 

88 


T 

T 


-0.63 

0.47 

* 



0.20 0.14 


Cys 

89 


T 

T 


0.07 

0.19 

* 



0.50 0.20 


Glu 

90 

A 

T 



0.48 

-0.24 

* 



0.70 0.22 


Cys 

91 

A 

T 



-0.03 

0.24 




0.10 0.28 


His 

92 

A 

T 



-0.49 

0.20 

* 



0.10 0.39 

15 

Glu 

93 

A 

T 



0.32 

0.20 

* 



0.10 0.12 


Trp 

94 

A 

T 



0.68 

0.20 

* 



0.10 0.39 


Val 

95 

A 

T 



0.43 

0.11 

* 



0.10 0.42 


Cys 

96 

A 

T 



1.10 

0.37 

* 



0.38 0.38 


Glu 

97 

A 

T 



0.79 

0.37 

* 



0.66 0.60 

20 

Thr 

98 

A 

T 



0.49 

-0.11 

* 


F 

1.69 0.80 


Tyr 

99 


T 

T 


0.47 

-0.37 



F 

2.52 1.99 


Asp 

100 


T 

T 


0.66 

-0.46 



F 

2.80 1.66 


Gly 

101 


T 

T 


0.73 

0.11 



F 

1.77 0.62 


Ser 

102 


T 

T 


0.39 

0.13 



F 

1.49 0.40 

25 

Thr 

103 


T 



0.67 

-0.20 



F 

1.61 0.24 


Cys 

104 


T 

T 


0.57 

0.30 


* 


0.78 032 


Ala 

105 


T 

T 


0.61 

0.30 


* 


0.50 0.24 


Gly 

106 


T 

T 


0.29 

-0.09 


* 


1,10 0.33 


His 

107 


T. 

T 


0.59 

0.00 


* 


0.50 0.33 

30 

Gly 

108 


T 



0.23 

-0.57 


* 

F 

1.35 0.55 


Lys 

109 


T 



0.56 

-0.50 


* 

F 

1.36 0.30 


Cys 

110 


T 

T 


1.19 

-0.50 




1.72 0.22 


Asp 

111 


T 

T 


0.87 

-1.00 




2.33 0.44 


Cys 

112 


T 

T 


0.94 

-0.86 




2.64 0.12 

35 

Gly 

113 


T 

T 


0.62 

-0.86 


* 

F 

3.10 0.44 


Lys 

114 


T 



0.58 

-0.86 


* 

F 

2.59 0.14 


Cys 

115 


T 



1.24 

-0.86 


* 

F 

2.28 0.44 


Lys 

116 


T 



0.90 

-1.03 


* 

F 

1.97 0.76 


Cys 

117 


T 



1.28 

-1.03 



F 

1.94 0.38 

40 

Asp 

118 


T 

T 


1.38 

-0.11 


* 

F 

1.81 0.74 


Gin 

119 


T 

T 


0.99 

0.07 


* 

F 

1.49 0.58 
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Gly 

120 



T 

T 

1.66 

0.50 


* 

F 

1.62 1.07 


Trp 

121 



T 

T 

1.02 

-0.07 


* 

F 

2.80 1.07 


Tyr 

122 



T 


1.02 

0.43 




1.12 0.62 


Gly 

123 



T 


1.02 

0.60 




0.84 0.34 

5 

Asp 
Ala 

124 
125 



T 
T 


0.78 

o;9i 

0.57 
0.41 




0.56 0.56 
0.28 0.56 


Cys 

126 



T 


0.89 

0.09 

* 



0.30 0.87 


Gin 

127 



T 


1.13 

0.14 

* 



0.30 0.75 


Tyr 

128 





C 0.81 

0.54 




-0.05 1.20 

10 

Pro 

129 



T 

T 

0.81 

0.61 


* 

F 

0.50 1.20 


Thr 

130 



T 

T 

0.59 

0.04 


* 

F 

0.80 1.15 


Asn 

131 



T 

T 

0.94 

0.33 


* 

F 

0.65 0.61 


Cys 

132 



T 

T 

0.99 

0.06 


* 


0.50 0.57 


Asp 

133 



T 


1.28 

-0.37 




0.90 0.79 

15 

Leu 

134 



T 


1.53 

-0.86 



F 

1.69 0.98 


Thr 

135 



T 


1.54 

-1.26 
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* 


1.35 1.49 

T 

1.34 

-0.11 


* 

F 

1.25 0.41 

T 

0.57 

-0.11 

* 


F 

1.25 0.95 

T 

0.46 

-0.19 

* 

* 

F 

1.59 0.33 

T 

1.31 

-0.94 

* 


F 

2.23 0.55 


1.67 

-1.37 

* 



2.22 0.46 

T 

2.12 

-1.37 

* 



2.76 0.62 

T 

1.41 

-1.86 

* 


F 

3.40 2.28 

T 

1.72 

-1.86 

* 


F 

2.91 0.70 
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228 


Arg 

366 



T 

T 


2.39 

-1.86 



F 

3.03 2.28 

Arg 

367 



T 



1.58 

-2.43 



F 

2.80 2.28 

Cys 

368 



T 



2.24 

-1.74 



F 

2.77 1.12 

Glu 

369 



T 



1.90 

-1.74 



F 

2.59 0.96 

Asp 

370 



T 

T 


1.04 

-1.31 

* 


F 

3. 1 0 0.48 

Leu 

371 



T 

T 


0.08 

-0.67 

* 


F 

2.79 0.67 

Asp 

372 



T 

T 


-0.70 

-0.60 



F 

2.48 0.29 

Gly 

373 



T 

T 


-0.38 

-0.03 

* 



I 72 0 09 

Val 

374 


B 




-0.72 

0.40 

* 




Val 

375 


B 




-0.76 

0.14 




-U. IU \J.\J f 

Cys 

376 



T 

T 


-0.29 

0.64 




0.20 0.09 

Gly 

377 



T 

T 


-0.60 

0.64 





Gly 

378 



T 

T 


-0.92 

0.49 



p 

0.35 0.23 

His 

379 



T 

T 


-0.37 

0.41 



p 

0.35 0 23 

Gly 

380 



T 



-0.18 

0.23 



p 

0.45 0.32 

Thr 

381 



T 



0.14 

0.37 

* 


p 


Cys 

382 



T 

T 


0.60 

0.37 

* 



U.JU U. \ £. 

Ser 

383 



T 

T 


0.28 

-0.13 

* 



i .iO U.ZJ 

Cys 

384 



T 

T 


-0.54 

0.01 

* 



n s/i n no 

Gly 

385 



T 

T 


-0.87 

0.17 

* 




Arg 

386 



T 



-0.56 

0.17 

* 




Cys 

387 


B 

T 



0.22 

-0.21 

* 



1 ,OKf U. lO 

Val 

388 


B 




0.18 

-0.79 

* 



1.52 0 32 

Cys 

389 


B 




0.56 

-0.79 

* 




Glu 

390 



T 

T 


0.20 

0.13 

* 




Arg 

391 



T 

T 


-0.26 

0.34 




0.68 0.38 

Gly 

392 



T 

T 


0.46 

0.13 




0.50 0.69 

Trp 

393 



T 

T 


0.50 

-0.44 




1.10 0.80 

Phe 

394 



T~ 



0.50 

0.24 

* 



0.30 0.34 

Gly 

395 



T 



0.50 

0.81 

* 



0.00 0.18 

Lys 

396 



T 



0.36 

0.79 

* 



0.00 0.30 

Leu 

397 



T 



0.49 

0.37 


* 


f| (LA n A~J 

Cys 

398 



T 



0.89 

0.01 

* 

* 


0.98 0.74 

Gin 

399 



T 



1.63 

-0.41 

* 



1.92 0.72 

His 

400 




T 

C 

1.31 

-0.41 

* 

* 


2.41 1.75 

Pro 

401 



T 

T 


1.27 

-0.53 

* 


F 

3.40 1.75 

Arg 

402 



T 

T 


t.48 

-0.70 

* 


F 

3.06 1.63 

Lys 

403 



T 

T 


1.83 

-0.49 



F 

2.42 1.18 

Cys 

404 



T 



1.83 

-0.50 

* 



2.03 1.11 

Asn 

405 

A 




C 

1.87 

-0.93 




1. 14 0.98 

Met 

406 

A 




c 

2.08 

-0.93 

* 


F 

0.95 0.85 
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229 



Thr 

407 


A 





C 

1.67 

-0.53 

* 


F 

1.44 2.73 


Glu 

408 

A 

A 






1.67 

-0,71 



F 

1.58 2.28 


Glu 

409 

A 

A 






2.33 

-1.11 

* 


F 

1.92 4.61 


Gin 

410 


A 



T 



1.52 

-1.33 

* 


F 

2.66 5.13 

5 

Ser 

411 





T 

T 


1.46 

-1.13 

* 


F 

3.40 2.44 


Lys 

412 





T 

T 


1.77 

-0.56 

* 


F 

2.91 0.76 


Asn 

413 






T 

C 

1.47 

-0.56 

* 


F 

2.62 0.76 


Leu 

414 






T 

C 

0.8S 

-0.57 

* 



2.38 0.76 


Cys 

415 





T 



0.88 

-0.46 

* 



1.99 0.38 

10 

Glu 

416 





T 



0.83 

-0.46 

* 


F 

2.05 0.40 


Ser 

417 





T 

T 


-0.10 

-0.43 

* 


F 

2.50 0.48 


Ala 

418 





T 

T 


-0.91 

-0.43 

* 


F 

2.25 0.62 


Asp 

419 





T 

T 


-0.77 

-0.31 

* 


F 

2.00 0.30 


Gly 

420 





T 

T 


-0.40 

0.26 

* 



1.00 0.12 

15 

He 

421 



B 





-0.74 

0.26 

* 



0.15 0.16 


Leu 

422 



B 





-0.40 

0.19 

* 

* 


0.15 0.09 


Cys 

423 





T 

T 


-0.16 

0.19 

* 

* 


1.00 0.19 


Ser 

424 





T 

T 


-0.46 

0.19 

* 

* 

F 

1.40 0.27 


Gly 

425 





T 

T 


-0.78 

-0.11 


* 

F 

2.25 0.43 

20 

Lys 

426 





T 

T 


0.08 

-0.23 


* 

F 

2.50 0.43 


Gly 

427 





T 



0.22 

-0.30 


* 

F 

2.05 0.44 


Ser 

428 





T 



0.54 

-0.11 

* 

* 

F 

1.80 0.24 


Cys 

429 





T 



0.89 

-0.11 

* 

* 


1.40 0.12 


His 

430 





T 

T 


0.57 

-0.11 

* 



1.35 0.24 

25 

Cys 

431 





T 

T 


-0.37 

0.03 

* 



0.50 0.10 


Gly 

432 





T 

T 


-0.69 

0.33 

* 



0.50 0.12 


Lys 

433 





T 

T 


-0.69 

0.33 

* 



0.50 0.05 


Cys 

434 





T 



-0.61 

0.21 

* 



0.300.12 


He 

435 


A 



T 



-0.58 

0.14 

* 



0.10 0.13 

30 

Cys 

436 


A 

B 





0.09 

-0.29 

* 



0.30 0.11 


Ser 

437 


A 





c 

0.14 

-0.29 

* 



0.50 0.35 


Ala 

438 


A 





c 

-0.14 

0.06 

* 



-0.10 0.52 


Glu 

439 


A 



T 



-0.37 

0.13 




0.25 1.53 


Glu 

440 


A 


B 

T 



0.22 

0.24 




0.10 0.80 

35 

Trp 

441 


A 


B 

T 



0.54 

0.24 


* 


0.25 1.06 


Tyr 

442 


A 


B 

T 



0.84 

0.17 




0.10 0.61 


lie 

443 




B 

T 



0.73 

0.17 




0.10 0.61 


Ser 

444 




B 

T 



0.07 

0.96 


* 


-0.20 0.50 


Gly 

445 





T 



0.07 

0.61 


* 

F 

0.15 0.17 

40 

Glu 

446 





T 



-0.31 

-0.14 


* 


0.90 0.4 1 


Phe 

447 





T 



-0.07 

-0.26 




1.24 0.16 
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Cys 

448 



T 

Asp 

449 


* 

T 

Cys 

450 



T 

Asp 

451 



T 


452 



T 

Arg 

453 



T 


454 



T 


455 



T 


456 



T 

Lvs 

457 



T 

His 

458 



T 


459 



T 

Gly 

460 



T 

Leu 

461 


B 


He 

462 


B 


Cys 

463 



T 

Thr 

464 



T 

Glv 

465 



T 


466 



T 

Glv 

467 



T 

He 

468 



T 

Cys 

469 



T 

Ser 

470 



T 

Cys 

471 



T 

Gly 

472 



T 


473 



T 


474 



T 

GIu 

475 



T 

Cys 

476 



T 

Tro 

477 



T 


478 



T 

Glv 

479 



T 

Trp 

480 



T 

Asn 

481 




Gly 

482 




Asn 

483 




Ala 

484 




Cys 

485 

A 


T 

Glu 

486 

A 

B 


lie 

487 

A 


T 

Trp 

488 

A 


T 
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230 


T 


0.82 

-0.64 

* * 


2.08 0.28 

T 


1.23 

-1.07 

* 


2.42 0.27 

T 


1.58 

-1.07 

* 


2.76 0.60 

T 


0.91 

-1.86 

* 

p 

3.40 1.87 

T 


1.61 

-1.86 

* 

F 

2.91 0.60 

T 


2.32 

-1.86 

* * 

F 

2 72 1 87 

T 


2 29 

-2 43 

* * 

p 

0 T> O OA 

T 


2.96 

-1 93 






2.61 

-1.93 


F 

2.52 1.56 



1.80 

-1.50 



2 710 92 

T 


0.80 

-0.81 




T 


0 13 

-0 70 



z.yi u.ou 




-0 13 







0 36 



0.78 0.17 







U.1U 







U.-ZU If- 1 o 



-1.10 

0.69 


F 

U.jj U.ZU 







ft ie n Oft 

T 


-0.91 

0.57 


F 

U.Jj U.zU 










-0.37 

0.47 



r\ /\/\ a t A - 

U.UU U.IU 

T 


-0.06 

0.47 



U.42 U.Uo 







A f^l A 1 ft 

T 


-0.38 

0.61 



A Of. ft ftfi 

U.OO U.Uo 







1 98 0 24 

T 


-0 10 

-0 07 



2.20 0.10 

T 


0.57 

0.46 



1.08 0.19 

T 


0.52 

-0.1 1 



1.76 0.32 

T 


0.90 

-0.1 1 



1 .54 0.20 

T 


1.24 

0.40 



0.42 0.39 

T 


0.90 

0.23 



0.50 0.36 

T 


1.57 

0.66 


F 

0.35 0.67 

T 


0.98 

0.49 


F 

0.50 1.02 

T 

C 

0.98 

0.07 


F 

0.45 0.62 

T 

C 

1.27 

0.64 

* 

F 

0.15 0.33 

T 

C 

0.38 

0.21 

* 


0.30 0.55 

T 

C 

0.43 

-0.01 



0.90 0.24 



-0.09 

0.50 



-0.20 0.25 



-0.43 

0.76 



-0.60 0.13 



-0.39 

0.79 



-0.20 0.13 



-0.39 

0.67 



-0.20 0.32 
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231 


Leu 

489 

A 



C 

-0.04 

0.10 



-0.10 0.32 

Gly 

490 


T 

T 


0.4t 

0.86 


F 

0.56 0.72 

Ser 

491 



T 

C 

0.02 

0.60 


F 

0.72 1.05 

Glu 

492 



T 

C 

0.52 

0.11 


F 

1.23 1.63 

Tyr 

493 



T 

C 

0.42 

-0.14 



L89 2.11 

Pro 

494 


T 



0.84 

-0.14 



2.10 2.01 

Ter 

495 


T 



0.80 

-0.10 



1.89 1.48 
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Table III 


Res Position 

I II 

III 

IV 

V 

VI 

VII 

VIII 

IX 

X 

XI 

XII 

XIII X 

Met 

1 

A A 






0.10 

-0.19 




0.30 0.92 

Glu 

2 • 

A A 






-0.32 

-0.11 

* 

* 


0.30 0.72 

Thr 

3 

A A 






0.18 

0.14 

* 



-0.30 0.47 

Gly 

4 

A A 






0.68 

-0.29 

* 



0.30 0.93 

Ala 

5 

A A 






0.86 

-0.90 

* 


p 

0.90 1.05 

Leu 

6 

A A 






1.46 

-0.47 



F 

0.60 1.12 

Arg 

7 

A 

B 





0.64 

-0.56 



F 

0.90 1.96 

Arg 

8 


B 





0.14 

-0.30 

* 


F 

0.80 1.60 

Pro 

9 

A 

B 





0.28 

-0.11 



F 

0.60 1.60 

Gin 

10 

A 

B 





0.06 

-0.37 



F 

0.60 1.26 

Leu 

11 

A 

B 





0.06 

0.31 

* 



-0.30 0.53 

Leu 

12 

A 

B 





-0.87 

LOO 




m fin n oh 

Pro 

13 

A 

B 





-1.79 

1.26 




-0.60 0.14 

Leu 

14 

A 

B 





-2.39 

1.54 




-O fid oid. 

Leu 

15 

A 

B 





-3.06 

1.54 




-0.60 0. 14 

Leu 

16 

A 

B 





-2.59 

1.43 




-0.60 0.05 

Leu 

17 

A 

B 





-2.12 

1.43 




-0.60 0.06 

Leu 

18 

A 

B 





-2.58 

1.17 




-0.60 0.07 

Cys 

19 


B 



T 


-1.98 

1.06 

* 

* 


-0.20 0.04 

Gly 

20 




T 

T 


-1.06 

0.80 

* 

* 


0.20 0.08 

Gly 

21 




T 

T 


-0.83 

0.11 


* 

F 

0.65 0.20 

Cys 

22 


B 



' T 


-0.37 

-0.07 


* 

F 

0.85 0.37 

Pro 

23 


B 





0.10 

-0.21 


* 

F 

0.96 0.37 

Arg 

24 




T- 

T 


0.10 

-0.21 



F 

1.87 0.37 

Ala 

25 




T 

T 


0.44 

-0.O7 


* 

F 

2.18 0.37 

Gly 

26 




T 

T 


0.79 

-0.24 


* 

F 

2.49 0.38 

Gly 

27 




T 

T 


1.14 

-0.67 


* 

F 

3.100.34 . 

Cys 

28 




T 



1.01 

-0.19 



F 

2.29 0.48 

Asn 

29 





T 

C 

0.30 

-0.26 



F 

1.98 0.48 

Glu 

30 


B 



T 


0.08 

-0.07 



F 

1.47 0.48 

Thr 

31 


B 



T 


0.42 

0.19 

* 


F 

0.56 0.74 

Gly 

32 


B 



T 


0.88 

-0.39 

* 


F 

0.85 0.80 

Met 

33 

A A 






0.73 

-0.79 

* 



0.60 0.91 

Leu 

34 

A A 






0.52 

-0.10 


* 


0.30 0.52 

Glu 

35 

A A 






-0.29 

-0.16 


* 


0.30 0.81 

Arg 

36 

A A 






-0.64 

0.10 


* 


-0.30 0.67 
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Leu 

37 

A 

A 





Pro 

38 

A 

A 





Leu 

39 

A ' 

A 





Cys 

40 

A 

A 




5 

Giy 

41 

A 

A 





Lys 

42 

A 

A 





Ala 

43 

A 

A 





Phe 

44 

A 

A 





Ala 

45 

A 

A 




10 


46 

A 

A 





Met 

47 

A 

A 





Met 

48 

A 

A 





Gly 

49 

A 

A 





Lys 

50 

A 



B 


15 

Val 

51 

A 






Asp 

52 

A 



g 



Val 

53 

A 



B 



Tip 

54 

A 



B 



Lys 

55 

A 



g 


20 

Trp 

56 

A 






Cys 

57 

A 



g 



Asn 

58 







Leu 

59 




B 



Ser 

60 




B 


25 

Glu 

61 




B 



Phe 

62 



B 

g 



lie 

63 



B 

B 



Val 

64 



B 

B 



Tyr 

65 



B 

B 


30 

Tyr 

66 



u 

g 



Glu 

67 




B 

T 


Ser 

68 




B 

T 


Phe 

69 





T 


Thr 

70 





T 

35 

Asn 

71 







Cys 

72 

A 






Thr 

73 

A 

A 





Glu 

74 

A 

A 





Met 

75 

A 

A 




40 

Glu 

76 

A 

A 





Ala 

77 

A 

A 





233 




-0.64 

0.06 

* 

* 


ft 1ft A AA 



0.00 

-0.20 

* 

* 


0 30 0 25 



0.22 

-0.20 

* 

* 


ft in n 'ic 
yj.jyj U.ZO 



-0.48 

0.30 

* 






-1.18 

0.40 




f\ 1A A 1 O 



-0.37 

0.47 

* 



ft An ATi 



-0 76 

-021 




0.30 0.67 



-0 54 





0.30 0.67 



-0.22 





-0.30 0.33 








-0.60 0.32 








0.30 0-75 




-0.20 




0.30 0.55 








0-60 0.55 








0.30 0.41 



0 04 

0 24 




-0.30 0.44 








0.30 0.89 








-0.30 0.47 




0. 69 




-0.60 0.34 








-0.60 0.32 



0.33 

1.13 




-0.60 0.36 



0.33 

0.87 




-0.60 0.46 


C 

0.49 

-0.04 




0.50 0.40 


C 

-0. 11 

0.74 




-0.40 0.33 




0.51 




-0.40 0.43 



-0.97 

0.59 




-0.60 0.20 








-0.60 0.38 



-0 54 

1 01 




-0.60 0.44 



-0 03 

0 63 




n &n ft a a 
-U.OU 0.44 



-0.43 

1 01 




-U.OU U.Oo 



-0 74 

1 01 




A Af\ A OA 



-0.04 

0.81 




-0.05 1.63 



0.18 

0.57 




-0.05 1 .68 

T 


0.72 

0.39 

* 



0.50 0.57 

T 


0.97 

0.11 

* 


F 

0.65 0.48 

T 

c 

0.6] 

0.1 1 



F 

0.45 0.62 

T 


0.61 

0.34 


* 

F 

0.25 0.71 



0.32 

-0.44 


* 

F 

0.45 0.85 



1.02 

-0.43 




0.30 0.53 



0.48 

-0.43 


* 


0.45 1.60 



-0.38 

-0.36 




0.30 0.82 



-0.06 

-0.20 


* 


0.30 0.35 
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Asn 

78 


Val 

79 


Val 

SO 


Gly 

81 

5 

Cys 

82 


Tyr 

83 


Tip 

84 


Pro 

85 


Asn 

86 

10 

Pro 

87 


Leu 

88 


Ala 

89 


Gin 

90 


Gly 

91 

15 

Phe 

92 


He 

93 


Thr 

94 


Gly 

95 


lie 

96 

20 

His 

97 


Arg 

98 


Gin 

99 


Phe 

100 


Phe 

101 

25 

Ser 

102 


Asn 

103 


Cys 

104 


Thr 

105 


Val 

106 

30 

Asp 

107 


Arg 

108 


Val 

109 


His 

110 


Leu 

111 

35 

Glu 

112 


Asp 

113 


Pro 

114 


Pro 

115 


Asp 

116 

40 

Glu 

117 


Val 

118 
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234 


A 

A 





-0.41 

0.23 




-0.30 0.35 


A 

B 




-0.66 

0.19 


* 


-0.30 0.11 


A 

B 




-0.34 

0.94 


* 


-0.60 0.17 


A 

B 




-0.56 

1.36 


* 


-0.60 0.11 





T 


. 0.03 

1.39 

* 



0.00 0.23 





T 


-0.18 

1.14 

* 



0.00 0.50 



B 



T 

-0.13 

0.93 




-0.20 0.78 






T 

C 0.13 

1.19 

* 


F 

0.30 1.20 






T 

C 0.48 

1.11 

* 


F 

0.15 0.77 






T 

C 0.80 

0.76 

* 


F 

0.30 1.27 







C 0.34 

0.27 

* 


F 

0.25 0.81 







C -0.26 

0.63 

* 


F 

-0.05 0.44 



B 

B 



-0.36 

0.91 

* 



-0.60 0.20 



B 

B 



-0.70 

0.97 

* 



-0.60 0.35 



B 

B 



-1.38 

0.71 

* 



-0.60 0.34 



B 

B 



-0.60 

0.90 




-0.60 0.14 



B 

B 



0.10 

1.00 

* 



-0.60 0.19 



B 

B 



0.10 

0.57 

* 



-0.60 0.43 



B 

B 



-0.26 

0.19 




-0.15 1.06 



B 

B 



-0.26 

0.29 

* 



-0.30 0.64 




B 

T 


0.33 

0.59 

* 



-0.20 0.56 




B 

T 


0.64 

0.54 

* 



-0.05 1.06 




B 

T 


0.32 

0.26 

* 



0.25 1.26 





T 

T 

0.90 

0.33 

* 



0.50 0.34 





T 

T 

0.08 

0.81 




0.20 0.29 





T 

T 

-0.03 

1.06 




0.20 0.25 





T 

T 

0.08 

0.27 




0.50 0.47 





T 


. -0.08 

-0,51 




1.20 0.69 


A 



T 


0.59 

-0.26 


* 


0.70 0.32 


A 

B 




0.08 

-0.16 


* 


0.30 0.81 


A 

B 




. 0.08 

-0.04 




0.30 0.46 


A 

B 




0.74 

-0.53 


* 


0.75 1.08 


A 

B 




0.84 

-1.17 


* 


0.75 1.08 


A 





C 1.49 

-0.74 


* 


0.80 0.85 


A 





C 1.49 

-0.31 


* 

F 

0.80 1 .78 


A 





C 1.38 

-0.96 

* 

* 

F 

1.10 2.18 






T 

C 1.38 

-1.46 

* 

* 

F 

1.50 4.58 





T 

T 

0.60 

-1.50 



F 

1.70 1.96 

A 





T 

0.52 

-0.81 

* 


F 

1.15 0.97 

A 





T 

0.31 

-0.13 

* 

* 


0-70 0.44 

A 



B 



-0.50 

-0.13 

* 



0.30 0.44 
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Leu 

119 


B 

B 



-1.18 

0.13 




-0.30 

0.22 

He 

120 


B 

B 



-1.82 

0.81 




-0.60 

0.09 

Pro 

121 


B 

B 



-2.71 

1.46 




-0.60 

0.09 

Leu 

122 


B 

B 



-2.92 

1.50 




-0.60 

0.07 

He. 

123 


B 

B 



-2.92 

1.24 




-0.60 

0.16 

Val 

124 


B 

B 



-2.97 

1.20 




-0.60 

0.08 

He 

125 


B 

B 



-2.89 

1.41 




-0.60 

0.07 

Pro 

126 


B 

B 



-2.99 

1.41 


* 


-0.60 

0.08 

Val 

127 


B 

B 



-3.03 

1.21 




-0.60 

0.16 

VaJ 

128 


B 

B 



-2.73 

1.21 


* 


-0.60 

0.17 

Leu 

129 


B 

B 



-2.48 

1.03 




-0.60 

0.11 

Thr 

130 


B 

B 



-2.18 

1.21 


* 


-0.60 

0.15 

Val 

131 


B 

B 



-2.31 

1.07 




-0.60 

0.20 

Ala 

132 

A 


B 



-2.27 

0.86 




-0.60 

0.25 

Met 

133 

A 


B 



-2.27 

0.86 




-0.60 

0.14 

Ala 

134 

A 


B 



-2.31 

1.01 




-0.60 

0.14 

Gly 

135 

A 


B 



-2.29 

1.01 

if 

* 


-0.60 

0.10 


136 

A 


B 



-1.32 

1.43 

* 

* 


-0.60 

0.11 

Val 

137 

A 


B 



-1.03 

0.81 


* 


-0.60 

0.21 

Val 

138 

A 


B 



-0.39 

0.70 

* 



-0.26 

0.29 

Tro 

139 


B 

B 



0 31 

0 27 

* 



0.38 0.70 

Arg 

140 


B 

B 



0.34 

-0.41 



F 

1.62 1.84 

Ser 

141 


B 



T 

1.16 

-0.57 



F 

2.66 3.58 

Lys 

142 




T 

T 

1.70 

-1.21 


* 

F 

3.40 5.68 

Arg 

143 




T 

T 

1.74 

-1.64 

* 


F 

3.06 4.19 

Thr 

144 




T 

T 

1.22 

-0.96 


* 

F 

2.72 2.58 

Asp 

145 

A 


T 


0.72 

-0.66 



F 

1.98 1.06 

Thr 

146 

A B 




0.63 

-0.23 

* 


F 

0.79 0.69 

Leu 

147 

A B 




0.20 

0.20 

4c 



-0.30 

0.61 

Leu 

148 

A B 




-0.30 

0.14 


* 


-0.30 

0.47 

Ter 

149 

A B 




-0.38 

0.57 




-0.60 

0.42 
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Res Position 

I 

II 





VII 

VIII 

IX 

X XI XII 

XIII 

XI 

5 

Met 

1 


A 






-0.76 

0.27 



-0.30 

0.49 


Arg 

2 

A 







-1.07 

0.34 

* * 


-0.30 

0.39 



3 








-1.49 

0.70 



-0.60 

0.26 


Leu 

4 








-1.40 

0.96 



-0.60 

0.22 


Ala 

5 


^ 






-1.82 

0.73 



-0.60 

0.15 

10 

Phe 

6 








-2.03 

t.41 



-0.60 

0.15 



7 

A 

A 






-2.73 

1.41 



-0.60 

0.15 


Ser 

g 


A 






-2.73 

1.23 



-0.60 

0.15 



9 


A 






-2.78 

1.41 



-0.60 

0.14 


Leu 

10 








-3.00 

1.27 



-0.60 

0.13 

15 

Ala 

1 1 








-2.30 

1.27 



-0.60 

0.08 



12 

A 

A 






-1.49 

L29 



-0.60 

0.17 


Val 

13 

A 

A 






-1.50 

0.60 



-0.60 

0.35 



14 








-1.03 

0.40 



-0.02 

0.50 


Gin 

15 



B 





-0.53 

0.33 


F 

0.41 0.60 

20 

Glu 

16 






T 


-0.53 

0.13 


F 

1.24 1.17 


Thr 

17 






T 


-0.02 

-0.01 



2.12 1.43 


Gly 

18 





T 

T 


0.02 

-0.31 


F 

2.80 1. 

11 


Thr 

19 






T 


0.62 

-0.03 



L97 0.53 


Ala 

20 








0.73 

0.40 


p 

0.89 0.57 

25 

Ser 

21 








0.78 

-0.09 


F 

L56 1. 

12 


Leu 

22 







c 

1.09 

-0.51 


F 

1.38 1.55 


Pro 

23 

A 

A 






1.54 

-LOO 

* 

F 

0.90 2.66 


Arg 

24 

A 

A 






1.90 

-1.50 

* * 

F 

0.90 3.88 


Lys 

25 

A 

A 






2.60 

-1.89 


F 

0.90 9.42 

30 

Glu 

26 

A 

A 






3.01 

-2.57 


F 

0.90 1 

.93 


Arg 

27 

A 

A 






3.82 

-3.00 


F 

0.90 11.93. 


Lys 

28 

A 

A 






4.03 

-3.00 


F 

0.90 10.33 


Arg 

29 

A 

A 






3.92 

-3.00 


F 

0.90 10.33 


Arg 

30 

A 

A 






3.28 

-2.60 


F 

0.90 9.13 

35 

Glu 

31 

A 

A 






3.07 

-1.99 


F 

0.90 4.52 


Glu 

32 

A 

A 






3-07 

-1.56 


F 

1.24 3.57 


Gin 

33 

A 

A 






3.02 

-1.56 

* 

F 

1.58 3.57 


Mel 

34 


A 





c 

2.57 

-1.56 

* 

F 

2.12 3.57 


Pro 

35 

A 





T 


2.46 

-1.13 

* 

F 

2.66 2.04 

40 

Arg 

36 





T 

T 


2.16 

-1.13 

* 

F 

3.40 1.97 
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Glu 

37 







38 






Asp 

39 






Ser 

40 





< 

D 

Phe 

41 


B 





42 


B 




Val 

43 


B 





44 


B 





45 

A 




t ft 

Leu 

.46 

A 





Arg 


A 





Asn 

48 


B 


T 


Asp 

49 


B 




Val 

50 


B 



i *; 
i j 

Leu 

51 


B 





52 


B 




Pro 

53 






Asp 

54 




T 


Asn 

55 





ICS 

Tyr 

56 


B 

B 



Gly 

57 


B 

B 



Glu 

58 


B 

B 



Val 

59 


B 

B 



He 

60 


B 

B 



Asp 

61 


B 

B 




62 


B 




Ser 

63 






Asn 

64 






Tyr 

65 

A 




30 

Glu 

66 

A 





Glu 


A 





Leu 

68 


B 




Thr 

69 




T 


Asp 

70 




T 

35 

Tyr 

71 




T 


Gly 

72 




T 


Asp 

73 

A 





Gin 

74 

A 





Leu 

75 


B 



40 

Pro 

76 


B 




Glu 

77 


B 

B 
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T 


1.46 

-1.14 



F 

2.66 2.66 

T 


1.46 

-0.97 



F 

2.72 1.49 



1.20 

-1.40 



F 

1.78 1.32 



0.60 

-0.76 



F 

1.29 0.57 



0.28 

-0.07 




0.50 0.47 



-0.53 

-0.07 




0.50 0.44 



-0.08 

0.61 




-0,40 0.27 



-0.08 

0.23 




-0.10 0.61 



0.22 

-0.16 


* 


0.50 0.56 

T 


0.07 

-0.16 




0.85 1.27 

T 


-0.74 

-0.16 



F 

1.00 1.14 

T 


0.11 

-0.16 


* 

F 

1.25 0.61 

T 


0.71 

-0.19 


* 

F 

1.00 1.19 



0.92 

-0.44 


* 

F 

0.65 0.94 



1.73 

-0.44 


* 

F 

0.95 0.97 

T 


1.38 

-0.44 



F 

1.45 0.94 

T 

C 

1.03 

0.31 



F 

1.50 1.98 

T 


1.03 

0.10 



F 

2.00 2.37 

T 


1.03 

-0.59 



F 

3.00 2.56 



0.96 

-0.34 



F 

1.80 1.23 



0.96 

-0.09 




1.20 0.52 



0.36 

-0-09 

* 



0.90 0.53 



0.06 

0.20 




0.00 0.28 



0.06 

-0J7 




0.30 0.38 



0.06 

-0.20 




0.30 0.35 

T 


0.40 

0.56 




-0.20 0.75 

T 

C 

0.40 

-0.09 




1.05 1.85 

T 

C 

0.44 

-0.77 



F 

1.50 1.91 

T 


1.02 

-0.09 



F 

1.34 1.91 



1.02 

-0.29 



F 

1.48 2.06 



1.59 

-0.67 

* 


F 

2.12 2.14 



1.54 

-0.31 

* 


F 

2.16 2.14 . 

T 


1.54 

-0.64 

* 


F 

3.40 1.22 

T 


1.79 

-0.64 

* 


F 

3.06 1.18 

T 


0.98 

-0.24 

* 


F 

2.42 2.48 

T 


0.77 

-0.24 

* 


F 

2.08 1.42 



1.58 

-0.30 



F 

1.23 1.31 



1.03 

-0.30 

* 

* 

F 

0.98 1 .45 



1.08 

-0.41 


* 

F 

1.07 1.09 



0.47 

-0.84 



F 

1.46 1.30 



0.50 

-0.20 


* 

F 

0.90 0.56 
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Val 

78 

Lys 

79 

Val 

80 

Thr 

81 

Ser 

82 

Leu 

83 

Ala 

84 

Pro 

85 

Ala 

86 

Thr 

87 

Ser 

■ 88 

lie 

89 

Ser 

90 

Pro 

91 

Ala 

92 

Lys 

93 

Ser 

94 

Thr 

95 

Thr 

96 

Ala 

97 

Pro 

98 

Gly 

99 

Thr 

100 

Pro 

101 

Ser 

102 

Ser 

103 

Asn 

104 

Pro 

105 

Thr 

106 

Met 

107 

Thr 

108 

Arg 

109 

Pro 

110 

Thr 

111 

Thr 

112 

Ala 

113 

Gly 

114 

Leu 

115 

Leu 

116 

Leu 

117 

Ser 

118 


B B 



0.20 

-0.11 

* 

F 

0.81 0.98 

B B 



-0.61 

-0.41 

* 

F 

0.72 0.85 

B B 



-0.39 

-0.16 

* 

F 

0.63 0.40 

B B 



-0.39 

0.34 

* 

F 

-0.06 0.55 

B 



-0.98 

0.13 

* 

F 

0.05 0.42 

B 



-0.43 

0.63 

* 


-0.40 0.58 

B 



-0.78 

0.47 

* 


-0.40 0.58 




-0.81 

0.37 



-0.10 0.58 

B B 



-o.so 

0.67 


F 

-0.45 0.49 

B B 



-0.71 

0.37 


F 

-0.15 0.65 

B B 



-0.49 

0.30 


F 

0.13 0.65 

B B 



0.14 

0.37 


F 

0.41 0.65 

B 

T 


0.06 

-0.13 


F 

1.69 0.90 


T 


0.33 

-0.23 


F 

2.17 0.90 


T T 


0.33 

-0.13 


F 

2.80 1.86 

B 

T 


0.04 

-0.33 


F 

2.12 2.00 

B 



0.72 

-0.21 


F 

1.64 1.31 

B 



0.68 

-0.21 


F 

1.36 2.00 

B 



0.58 

-0.29 


F 

0.93 0.99 

B 



0.96 

0.20 


F 

0.20 1.07 

B 



0.61 

0.24 


F 

0.48 1.14 




0.61 

0.14 


F 

1.16 1.06 


T 

C 

0.92 

0.04 


F 

1.44 1.41 


T 

C 

1.02 

-0.06 


F 

2.32 1.46 


T T 


1.30 

-0.06 


F 

2.80 2.28 


T 

c 

0.91 

0.00 


F 

2.32 2.28 


T 

c 

0.94 

0.13 


F 

1.44 1.46 


T 

c 

1.37 

0.19 


F 

1.36 1.57 


T . T 


1.37 

-0.20 


F 

2.08 2.30 

B 

T 


1.36 

-0.16 . 


F 

1.60 2.21 

B 



1.34 

-0.07 . 


F 

1.60 2.07 

B 

T 


0.76 

-0.01 


F 

2.00 2.07 

B 

T 


0.62 

0.00 * 


F 

1.80 2.11 

B 

T 


0.12 

-0.19 


F 

t.60 1.45 

B 

T 


-0.09 

0.01 


F 

0.65 0.61 

B 



-0.59 

0.70 


F 

-0.25 0.32 

B 



-1.00 

0.96 



-0.60 0.19 

B 



-1.09 

0.86 



-0.60 0.17 

B 



-0.78 

0.76 



-0.60 0.23 

B 



-0.68 

0.66 

* 

F 

-0.45 0.40 

B 



-0.09 

0.66 


F 

-0.29 0.75 
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Ser 

119 


B 




Gin 

120 


B 




Pro 

121 




T 


Asn 

122 




T 

5 

His 

123 




T 


Gly 

124 




T 


Leu 

125 






Pro 

126 




T 


Thr 

127 




T 

10 

Cys 

128 






Leu 

129 


B 

B 



Val 

130 


B 

B 



Cys 

131 


B 

B 



Val 

132 


B 

B 


15 

Cys 

133 


B 

B 



Leu 

134 


B 

B 



Gly 

135 




T 


Ser 

136 




T 


Ser 

137 


B 



20 

Val 

138 


B 




Tyr 

139 


B 




Cys 

140 


B 




Asp 

141 


B 




Asp 

142 


B 



25 

He 

143 


B 




Asp 

144 

A 

B 




Leu 

145 " . 

A 

B 




Glu 

146 

A 

B 




Asp 

147 

A 

B 



30 

lie 

148 

A 

B 




Pro 

149 






Pro 

150 






Leu 

151 






Pro 

152 


B 



35 

Arg 

153 




T 


Arg 

154 


B 




Thr 

155 


B 

B 



Ala 

156 


B 

B 



Tyr 

157 


B 

B 


40 

Leu 

158 


B 

B 



Tyr 

159 


B 

B 





0.22 

0.37 



p 

0.52 1.46 

T 


0.69 

0.19 



F 

0.88 2.41 

T 


0.69 

-0.07 



p 

2.04 1.78 

T 


1.29 

0.23 



p 

1.60 1.10 

T 


1.28 

0.27 



p 

1.29 0.98 



0.91 

0.36 




0 78 0 91 

T 

Q 

0.10 

0 50 




U..JZ U.JU 

T 


-0.54 

0.79 




0.36 0.18 

T 


-1.21 

0.93 




rv nft ft i a 

U.zU U.14 

T 


-2.03 

1.07 







-2.36 

1.03 




-U.OU U.U4 



-2.36 

1 17 




-U.oU 0.02 



-2.49 

1.37 




ft £ft A /VI 



-2.48 

1.23 




ft ezn n fii 



-2.11 

0.93 




a fin n 



-2.16 

0.67 




-0.60 0.13 

T 






F 

0.35 0.13 

T 






F 

u.jd o.3y 

T 


-0.69 

0 86 



F 

ft ne ri i« 

T 







ft 1ft ft A 1 

0.10 0.43 








0.50 0.53 

T 







ft 1 ft ft i o 

O.IO 0,26 

T 







ft "7ft ft 

O. /O U.OJ 

T 


0.03 

-0.30 



F 

ft oc n n 
O.DO U.J J 

T 










0.24 

-1.63 



p 

0.90 1.07 



0.70 

-0.94 



p 

0.75 0.45 



0.49 

-0.51 



p 

0 75 0.99 



-0.32 

-0.77 


* 

p 

0 99 0 92 



0.36 

-0.09 

* 


p 

0 93 0 92 


Q 

0.47 

-0.34 

* 


p 

1.57 0.82 


C 

1.39 

-0.34 



F 

1.81 0.96 . 

T 

c 

1.08 

-0.34 

* 


F 

2.40 2.68 

T 


0.49 

-0.54 



F 

2.26 2.50 

T 


1.13 

-0.47 

* 


F 

2.12 1.63 

T 


0.53 

-0.14 

* 


F 

1.48 3.11 



0.50 

-0.14 

* 



0.69 1.66 



0.72 

0.19 

* 

* 


-0.15 1.33 



1.04 

0.69 
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-0.10 0.62 
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-0.30 0.36 

40 
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IV 

V 

VI 
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IX 

X 

XI 
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Met 

1 

A 

B 





-1.30 

0.70 




-0.60 0.39 

Leu 

2 

A 

B 





-1.72 

0.96 




-0.60 0.25 

Leu 

3 

A 

B 





-2.14 

1.21 




-0.60 0.16 

Pro 

4 

A 

B 





-2.06 

1.47 




-0.60 0.14 
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5 
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-0.60 0.22 
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0.91 

* 


-0.60 0.36 



489 



g 

B 




-074 

0.63 

* 


-0.60 0.84 



490 



B 



T 


-0.52 

0.37 


F 

0.25 0.62 


Ser 







T 

c 

-0.33 

0.49 

* 

F 

0.15 0.43 

-> 

Asn 

492 






T 

C 

-0.84 

0.79 


F 

0.15 0.37 


Gly 

493 






T 


-0.88 

0.97 


F 

0.15 0.35 


Ala 

494 




B 



Q 

-1.18 

0.61 



-0.40 0.26 


Phe 

495 



B 





-0.82 

0.91 



-0.60 0.11 


Leu 

496 



B 





-1.41 

0.94 



-0.60 0.11 

i n 
IV 

Gly 

497 



B 

B 




-1.72 

1.20 



-0.60 0.08 


He 

498 








-1.97 

1.19 



-0.60 0.13 


Gly 

499 



B 

B 




-2.19 

0.90 



-0.60 0.16 


He 

500 


A 

B 





-2.30 

0.90 



-0.60 0.13 


Thr 

501 


A 

B 





-2.19 

1.16 



-0.60 0.16 

15 

Ala 

502 


A 

B 





-2-66 

1.26 



-0.60 0.14 


Leu 

503 


A 

B 





-2 43 

1.51 



-0.60 0.16 


Leu 

504 


A 

B 





-2.90 

1.40 



-0.60 0.06 


Phe 

505 


A 

B 





-2.60 

1.60 



-0.60 0.05 


Leu 

506 


A 

B 





-3 10 

1.60 



-0.60 0.06 

in 

Cys 

507 

A 

A 






-3 40 

1.60 



-0.60 0.06 


Leu 

508 

A 

A 






-3-48 

1 60 



-0.60 0.05 


Ala 

509 

A 

A 






-3.27 

1.50 

* 


-0.60 0.04 


Leu 

510 

A 

A 






-2.52 

1.43 

* 


-0.60 0.08 


He 

511 


A 

B 





-2.60 

0.86 

* 


-0.60 0.19 

Zo 

lie 

512 


A 

B 





-2.74 

0.86 

* 


-0.60 0.13 


Met 

513 


A 

B 





-2.14 

1.04 

* 


-0.60 0.13 


Lys 

514 


A 

B 





-1.51 

0.79 

* 


-0.26 0.28 


He 

515 


A 

B 





-0.59 

0,10 



0.3S 0.81 


Leu 

516 


A 






0.41 

-0.59 



1.77 1.61 

Dk) 

Pro 

517 






T 

Q 

0.99 

-1.20 


F 

2.86 1.57 


Lys 

518 





T 

T 


1.59 

-0.71 

* F 

3.40 3.24 


Arg 

519 





T 

T 


1.23 

-1. 00 

. * F 

3.06 6.80 


Arg 

520 





T 

T 


2.12 

-1.20 


F 

2.72 6.34 


Thr 

521 





T 



2.62 

-1.63 


F 

2.18 5.49 

35 

Gin 

522 



B 





2.62 

-1.14 

* 

F 

1.44 4.05 


Thr 

523 



B 





2.69 

-0.71 

* 

F 

1.10 3.20 


Gtu 

524 



B 





2.37 

-0.71 

* 

* F 

1.104.34 


Thr 

525 



B 



T 


2.37 

-0.77 


* F 

1.30 3.87 


Pro 

526 






T 

C 

1.98 

-1.17 


* F 

1.50 5.26 

40 

Arg 

527 






T 

c 

1.6S 

-0.87 


* F 

1.84 2.63 


Pro 

528 





T 

T 


2.L0 

-0.49 


* F 

2.08 2.44 
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Arg 

529 



T 


2.07 

-0.97 



F 

2.52 3.09 

Phe 

530 



T 


2.08 

-0.90 

* 


F 

2.86 2.15 

Ser 

531 



T 

T 

1.98 

-0.51 


* 

F 

3.40 1.86 

Arg 

532 

B 



T 

0.98 

-0.46 


* 

F 

2.36 1.37 

His 

533 

B 



T 

0.38 

0.23 

* 


F 

1.42 1.11 

Ser 

534 

B 



T 

0.27 

0.13 

* 


F 

0.93 0.68 

Thr 

535 

B 

B 



0.72 

-0.26 

* 



0.64 0.58 

Ile 

536 

B 

B 



0.13 

0.50 

* 



-0.60 0.67 

Leu 

537 

B 

B 



0.02 

0.69 




-0.60 0.35 

Asp 

538 

B 

B 



-0.80 

0.70 

* 



-0.60 0.39 

Tyr 

539 

B 

B 



-1.36 

0.86 

* 



-0.60 0.41 

lie 

540 

B 

B 



-1.26 

0.81 

* 



-0.60 0.37 

Asn 

541 

B 

B 



-0.68 

0.56 

* 



-0.60 0.34 

Val 

542 

B 

B 



■ -0.46 

1.04 




-0.60 0,32 

Val 

543 

B 

B 



-0.80 

0.79 

* 



-0.60 0.46 

Pro 

544 

B 




-0.77 

0.53 

* 


F 

-0.25 0.28 

Thr 

545 

B 



T 

-0.69 

0.56 

* 


F 

-0.05 0.59 

Ala 

546 

B 



T 

-1.28 

0.60 

# 


F 

-0.05 0.65 

Gly 

547 

B 



T 

-0.42 

0.46 

* 


F 

-0.05 0.43 

Pro 

548 

B 



T 

0.48 

0.43 

* 


F 

0.21 0.51 

Leu 

549 

B 




0.80 

-0.06 



p 

1.32 1.01 

Ala 

550 

B 




1.11 

-0.56 



F 

1.88 2.00 

Gin 

551 

B 




1.70 

-0.59 



F 

2.14 2.08 

Lys 

552 

B 



T 

2.09 

-0.61 

* 


F 

2.60 4.38 

Arg 

553 

B 



T 

1.71 

-1.30 



F 

2.34 8.66 

Asn 

554 

B 



T 

2.21 

-1.30 



F 

2.08 5.05 

Gin 

555 

B 



T 

2.59 

-1.21 

* 


F 

2.10 3.65 

Lys 

556 

B 




2.59 

-0.79 

* 


F 

1.92 2.88 

Ala 

557 





C 2.24 

-0.39 

* 


F 

1.84 2.88 

Thr 

558 




T 

C 1.92 

-0.40 

* 


F 

2.32 2.23 

Pro 

559 



T 

T 

2.03 

-0.37 

* 


F 

2.80 1.72 

Asn 

560 



T 

T 

1.72 

-0.37 

* 


F 

2.52 3.34 

Ser 

561 




T 

C 1.47 

-0.39 

* 


F 

2.04 3.34 

Pro 

562 



T 


1.24 

-0.44 

* 


F 

1.76 3.34 

Arg 

563 



T 


1.34 

-0.19 

* 


F 

1.48 1.71 

Thr 

564 

B 




1.34 

-0.16 

* 


F 

0.80 1.98 

Pro 

565 

B 




1.00 

-0.11 



F 

0.80 1.98 

Leu 

566 

B 




0.71 

-0.11 


* 

F 

0.65 1.00 

Pro 

567 

B 



T 

0.71 

0.39 


* 

F 

0.25 0.70 

Pro 

568 



T 

T 

0.30 

0.33 



F 

0.65 0.70 

Gly 

569 




T 

C 0.40 

0.29 



F 

0.60 1.14 
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Ala 

570 





T 

C 

0.61 

0.03 


F 

0.94 1.14 

Pro 

571 






C 

1.12 

-0.40 


F 

1.68 1.27 

Ser 

572 





T 

C 

1.38 

-0.44 


F 

2.22 1.72 

Pro 

573 





T 

c 

1.63 

-0.87 


F 

2.86 3.41 

Glu 

574 




T 

T 


1.98 

-1.37 


F 

3.40 4.41 

Ser 

575 




T 

T 


2.57 

-1.40 


F 

3.06 5-29 

Lys 

576 




T 

T 


2.82 

-1.39 


F 

2.94 5.93 


577 




T 

T 


3.17 

-1.81 


F 

2.82 6.84 

Asn 

578 




T 

T 


3.38 

-1.81 


F 

2.70 10.21 

Gin 

579 




T 

T 


3,13 

-1.80 

* 

F 

2.58 8.84 


580 



B 




3.43 

-1.04 

* 

F 

2.20 6.93 

Lys 

581 



B 




2.58 

-0.64 

* 

F 

1.98 7.46 

Gin 

582 



B 




2.32 

-0.36 

* 

F 

1.46 3.55 

Tvr 

583 



B 




2.02 

-0.33 



1.09 2.75 

Gin 

584 



B 




1.32 

0.06 



0.27 1.84 


585 



B 


T 


1.07 

0.84 



-0.20 0.92 

Pro 

586 



B 


T 


1.02 

0.87 


F 

-0.05 0,91 

Ser 

587 



B 


T 


0.81 

0.11 


F 

0.59 0.91 

Phe 

588 



B 


T 


1.10 

0.14 


F 

1.08 1.70 

Pro 

589 







0.80 

-0.54 


F 

2.32 2.20 

Glu 

590 






Q 

1.31 

-0.59 


p 

2.66 2.20 

Pro 

591 




T 

T 


1.21 

-0.59 


F 

3.40 3.41 

Lvs 

592 




T 

T 


1.51 

-0.89 


F 

3.06 3.18 

Ser 

593 





T 

Q 

1.62 

-0.91 


F 

2.52 3.18 

Ser 

594 





T 

c 

1.62 

-0.41 

* 

F 

1.88 2.08 

Thr 

595 






c 

1.62 

-0.41 

* 

F 

1.64 1.61 

Gin 

596 






c 

1.53 

-0.41 

* 

F 

1.60 2.08 

Ala 

597 





T 

c 

1.49 

-0.41 

* 

F 

2.10 2.08 

Pro 

598 





T 

c 

1.79 

-0.40 


F 

2.40 2.49 

Glu 

599 





T 

c 

1.79 

-0.89 


F 

3.00 2.49 

Ser 

600 





T 

c 

2.10 

-0.90 


F 

2.70 3.31 

Gin 

601 






Q 

2.10 

-1.00 


p 

2.00 3.70 

Glu 

602 


A 




c 

2.69 

-1.43 


F 

1.70 3.70 

Ser 

603 


A 




c 

2.09 

-1.43 


F 

1.40 4.79 

Gin 

604 

A 

A 





2.06 

-1.13 


F 

0.902.28 

Glu 

605 

A 

A 





2.11 

-1.03 

* 

F 

0.90 1.79 

Glu 

606 

A 

A 





1.52 

-0.27 

* 

F 

0.60 2.09 

Leu 

607 


A 

B 




1.21 

-0.16 



0.45 1.22 

His 

608 


A 

B 




0.70 

-0.07 



0.45 1.02 

Tyr 

609 


A 

B 




0.70 

0.61 



-0.60 0.48 

Ala 

610 


A 

B 




0.00 

1.01 

* 


-0.60 0.95 
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Thr 

611 

A 

B 





-0.21 



* 


-0.60 0.60 

Leu 

612 

A 



T 



0.26 

1.04 





Asn 

613 

A 

B 





-0.57 

0.71 




-VF.OU U.JO 

Phe 

614 


B 

B 




-0.21 

0.86 




-U.OU U.JU 

Pro 

615 



g 

T 



0.17 

0.37 


* 


0.25 0.71 

Gly 

616 



g 

T 



0.59 

0.1 1 




KJ.J.D U.Oo 

Val 

617 



g 



Q 

1.19 

-0 29 



F 

0.80 1.54 


618 





T 


1 19 

-0 64 



F 

1.50 1.54 

Pro 

619 





T 


1 30 




F 

l.DU 2./K) 

Art? 

620 





T 


1 62 

-1-00 



F 

1.50 3.68 

Pro 

621 


g 



T 






F 

1.30 3.68 

Glu 

622 




T 



2.01 

-L03 



F 

1 .84 2.35 

Ala 

623 


g 






-1.03 



F 

1.78 1.86 

Arg 

624 


g 










1.97 2.40 

Met 

625 


g 



T 



-1.03 



F 

2.66 L37 

Pro 

626 




T 



1.60 

-0.54 



F 

3.40 1.96 

Lys 

627 




T 

T 


1.01 

-0,64 



F 

3.06 L74 

Glv 

628 






C 

1.60 

-0.14 



F 

2.22 1.77 

Thr 

629 








-0.76 



F 

1.78 L91 

Gin 

630 


g 





1.26 

-0.43 



F 

0.94 L50 

Ala 

631 

A 

B 





1.47 

0.07 



F 

0.00 1.53 

Asp 

632 

A 

B 





0.57 

-0.36 




0.45 1.84 

Tyr 

633 

A 

B 





0.96 

-0.20 


* 


0.30 0.79 

Ala 

634 

A 

B 





0.57 

-0.60 


* 


0.75 1.56 

Glu 

635 

A 

B 





0.57 

-0.31 


* 


0.30 0.81 

Val 

636 

A 

B 





0.77 

0.09 




-0.30 0.89 

Lys 

637 

A 

B 





0.38 

-0.24 




0.45 1.13 

Phe 

638 

A 

B 





0.23 

-0.31 


* 


0.30 0.83 

Gin 

639 

A 

B 





0.43 

0.11 


* 


-0.15 i.44 

Ter 

640 

A 

B 





0.04 

-0.10 


* 


0.30 0.92 
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Table VI 


Res Position 


III 

IV V 

VI 

VII 

VIII 

IX 

X 

XI 

XII 

XIII XI 

Met 

1 


B 




-1.90 

0.96 




-0.60 0.21 

Leu 

2 

A 

B 




-2.32 

1.21 




-0.60 0.13 

lIu 

3 


B 




-2.74 

1.47 




-0.60 0.09 


4 


B 




-2.57 

1.73 




-0.60 0.07 

Leu 

5 


B 




-2.99 

1.54 




-0.60 0.14 




B 




-3.20 

1.54 




-0.60 0.14 

Leu 

7 

A 

B 




-2.68 

1.54 




-0.60 0.14 

Pro 

g 

A 

B 




-2.21 

1.77 




-0.60 0.17 


9 

A 

B 




-1.29 

1.51 




-0.60 0.21 

Leu 

10 

A 

B 




-0.48 

0.83 




-0.60 0.49 



A 




C 

0.44 

0.14 




-0.10 0.55 

Glv 

12 

A 




c 

0.40 

-0.29 


* 

F 

0.80 1.31 

Anr 

13 

A 




c 

0.61 

-0.33 



F 

0.80 1.18 

GIu 

14 

A 

B 




1.08 

-1.01 



F 

0.90 1.94 


15 

A 

B 




1.89 

-1.50 



F 

1.24 1.94 

Val 

16 

A 

B 




2.22 

-1.53 



F 

1.58 1.71 

Glu 

17 

A 


T 



2.27 

-1.53 



F 

2.32 1.98 

Glv 

IS 



T 



2.16 

-1.14 



F 

2.86 1.35 

Gin 

19 



T 

T 


2.27 

-0.74 



F 

3.40 2.93 

Lys 

20 




T 

c 

2.20 

-1.39 



F 

2.86 3.32 

Ser 

21 




T 

c 

3.06 

-1.39 



F 

2.86 6.70 


22 



T 

T 


2.S1 

-1.81 



F 

3.06 6.46 

Arg 

23 



T 

T 


2.86 

-1.46 



F 

3.06 5.06 

Lys 

24 



T 

T 


2.04 

-1.07 



F 

3.06 5.06 

Asp 

25 



T 

T 


1.69 

-0.77 

* 


F 

3.40 2.60 

Tyr 

26 


B 


T 


1.39 

-0.69 




2.51 1.91 

Ser 

27 


B 

B 



1.39 

-0.07 


* 


1.32 0.95 . 

Leu 

28 


B 

B 



0.98 

0.33 


* 


0.38 0.98 

Thr 

29 


B 

B 



0.63 

0.71 


* 


-0.26 0.84 

Met 

30 


B 


T 


-0.22 

0.34 


* 


0.10 0.84 

Gin 

31 


B 


T 


-0.29 

0.60 


* 

F 

-0.05 0.76 

Ser 

32 


B 


T 


-0.84 

0.40 


* 

F 

0.25 0.76 

Ser 

33 


B 


T 


-0.03 

0.56 


* 

F 

-0.05 0.57 

Val 

34 


B 

B 



0.28 

0.34 


* 

F 

-0.15 0:57 

Thr 

35 


B 

B 



0.53 

-0.06 


* 

F 

0.45 0.73 

Val 

36 


B 

B 



-0.07 

-0.01 



F 

0.45 0.54 
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Gin 

37 


B 

B 


Glu 

38 


B 

B 


Gly 

39 



B 

T 

Met 

40 


B 

B 


Cys 

41 


B 

B 


Val 

42 


B 

B 


His 

43 


B 

B 


Val 

44 


B 

B 


Arg 

45 


B 

B 


Cys 

46 


B 



Ser 

47 




T 

Phe 

48 




T 

Ser 

49 


B 



Tyr 

50 


B 



Pro 

51 


B 



Val 

52 




T 

Asp 

53 


B 



Ser 

54 


B 



Gin 

55 


B 



Thr 

56 




T 

Asp 

57 




T 

Ser 

58 





Asp 

59 


B 



Pro 

60 


B 



Val 

61 




T 

His 

62 


B 



Gly 

63 


B 



Tyr 

64 


B 



Trp 

65 


B 



Phe 

66 


B 



Arg 

67 


B 



Ala 

68 




T 

Gly 

69 




T 

Asn 

70 




T 

Asp 

71 





lie 

72 





Ser 

73 

A 



T 

Trp 

74 

A 

B 



Lys 

75 

A 

B 



Ala 

76 

A 

B 



Pro 

77 


& 
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-0.43 

0.21 




-0. 1 5 0.72 



-0.99 

0.14 



F 

-0. 1 5 0.27 



-0.17 

0.30 


* 


0.10 0.27 



-0.71 

0.16. 

* 

* 


_n "\n (191 

-U.JV V.Z1 



0.26 

0.40 

* 

* 


n f\ no 



-0.41 

0.40 

* 

* 


ft 'lA A 1 0 
•U.JU U. 10 



-0.71 

0.54 

* 



-U.oU U.IU 



-L07 

0.31 




-U..JU U.Z4 



-0.77 

O.53 




-U.OU U.Zo 

T 


-0.34 

0.27 




U. 11) U.zo 

T 


0.30 

0.53 




ft tft ft crv 

T 


-0-52 

0 31 

* 



ft <ft ft /f £1 

T 


0.33 

0.96 




ft oft ri £>i 
-U.ZU U.64 



-0.08 

0.39 




-0.10 ,0-80 

T 


0.59 

0.39 




0.25 1.24 

T 


0.58 

0 00 



F 

1.40 1.60 

T 


1 28 




F 

0.74 1.47 

T 


1 .28 

-0 66 



F 

1.98 1.59 



1 52 




F 

Z.IZ z.o7 







F 

Z-OO Z.87 

T 


1 52 




F 

3.40 3-31 

T 


1 49 

-0 66 



F 

2.86 1.42 

T 






F 

2.32 1.34 

T 


1 20 

-061 



F 

1.83 0.79 



1 22 

0 14 




ft £.A A til 

T 


0.52 

0.67 

* 



ft Oft ft « 
-U.ZU U.JO 

T 


0.93 

1.46 

* 



ft Oft ft "ll 
-U.ZU U.JJ 

T 


0.34 

1.03 

* 
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B 



Glu 
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A 

B 



Leu 
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A 

B 
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A 
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311 

A 

B 
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312 
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Gly 
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T 

Glu 

315 




T 

Gly 

316 




T 

Glu 

317 

A 



T 

Phe 

318 

A 

B 



Thr 

319 

A 

B 



Cys 

320 

A 

B 



Arg 

321 

A 



T 

Ala 

322 

A 



T 

Gin 

323 

A 



T 




1.19 

-0.17 


* 

F 

2.40 1.44 



0.90 

-0.17 


* 

F 

2.10 1.20 



0.89 

0.34 

* 

* 


0.30 0.82 



0.60 

0.40 

* 

* 


-0.30 0.76 



0.92 

0.89 

* 

* 


-0.60 0.79 



0.83 

0.39 

* 

* 


-0.30 0.79 



0.61 

0.77 

* 



-0.45 1 .29 



0.19 

0.77 

* 

* 


-0.60 0.79 



0.19 

0.87 


* 


-0.45 1 .09 



0.24 

0.64 

* 



-0.60 0.62 



0.32 

0.91 

* 



-0.20 0.93 



0.33 

0.43 

* 



-0.20 0.73 


c 

0.62 

0.81 




-0.40 0.73 

T 


0.30 

0.83 

* 


F 

0.10 1.53 

T 


0.30 

0.57 



F 

0.62 1.64 

T 


1.11 

0.47 



F 

0.74 2.67 

T 

c 

1.21 

0.19 



p 

0.96 2.74 

T 


1.21 

-0.I4 



F 

1.88 2.74 

T 

c 

0.60 

0.11 



F 

1.20 1.69 

T 

c 

0.00 

0.37 



p 

0.93 0.72 

T 


0.30 

0.66 

* 


p 

0.31 0.39 



-0.51 

0.23 

* 

* 

F 

0.09 0.50 



-0.30 

0.53 


* 


-0.48 0.26 



-0.86 

0.53 


* 


-0.60 0.29 



-0.89 

0.74 


* 


-0.60 0.26 



-1.49 

0.56 


* 


-0.60 0.47 



-1.02 

0.60 

* 

* 


-0.60 0.47 



-0.17 

0.34 


* 


-0.30 0.27 



0.64 

0.34 


* 


-0.30 0.55 


c 

0.30 

-0.34 

* 

* 


0.84 0.55 

T 

c 

1.11 

-0.31 


* 

F 

1 .73 0.73 

T 
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T 


0.96 

-0.67 


* 

F 

2.91 0.97 

T 


0.32 

-0.87 

* 

* 

F 

3.40 1.42 



1.24 

-0.73 

* 

* 

F 

2.51 0.46 



1.00 

-0.73 


* 


1.62 0.52 



1.00 

-0.23 

* 

* 


0.98 0.53 



1.00 

-0.26 

* 

* 


0.89 0.53 



1.04 

0.14 

* 

* 


0.60 0.98 



0.23 

-0.26 

* 

* 

F 

1.60 0.91 



0.59 

-0.06 

* 

* 

F 

2.00 1.40 
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269 



324 




T 

T 


0.60 

-0.20 



p 

2.50 0.71 

Ser 

325 





T 

c 

1.27 

0.19 

* 

* 

F 

1 .45 0.94 

Leu 

326 





T 


1.12 

0.09 

* 

* 

p 

1 .20 0.94 

Gly 

327 




T 

T 


0.86 

0.19 



p 

1.15 0.79 

Ser 

328 


B 

B 




0.56 

0.43 



p 

-0.20 0.44 

Gin 

329 


g 

B 




-0.26 

0.43 



F 

-0.45 0.7 1 










043 




n An n so 









0 04 

0 40 




n fin n "7i 

Ser 

332 


3 





0.09 

0.70 




_rt An ft "\a. 









-0 42 

0 69 


* 


ft rtrt ft 


334 







-0.42 

0.87 




-ft rtft ft "V7 

Leu 

335 







-0 39 

0 63 




ft in ft as 

Ser 

336 







0.47 

0.64 


* 


-0.25 1.01 









0.52 

-0.04 




0,45 1.09 

Gin 








1.02 

0.31 


* 

p 















q 24 2 22 

Glu 

340 

A 

B 





1 53 

0 16 


* 

p 

U.DO Z.OD 

Tyr 









-0 53 




2 32 3.08 

Thr 

342 




T 

T 


2.16 





2 76 1 76 

Gly 

343 




T 

T 


1.94 

-0.71 





Lys 

344 




T 

T 


1.09 

-0.29 



F 

2 76 1 96 

Met 

345 


B 






-0 40 



p 

1 82 1 01 

Arg 

346 


B 










1 .48 1 .37 

Pro 

347 


B 



T 


0.14 

-0.50 


* 

p 

1.19 0.68 












* 

p 

0.25 0.51 

Ser 






T 


-1.18 

0.21 

* 

* 

p 

0 25 0 21 

Gly 






T 


-0 92 

0.90 

* 



-0.20 0. 1 1 

Val 

351 



g 




-1.62 

0.90 

* 



-0.60 0.15 


352 


g 

B 




-2.27 

0.76 




-0.60 0.11 

Leu 

353 


g 

B 




-1.76 

1.01 




-0.60 0.09 

KMiy 

354 


g 

g 




-1.80 

1.01 




-0.60 0. 1 1 

Ala 

355 


& 

B 




-2.04 

0.80 




n £.t\ ft i A 

Val 

356 


B 

B 




-1.53 

0.61 




-0.60 0.17 

Gly 

357 


B 





-1.31 

0.36 




-0.10 0.17 

Gly 

358 





T 

c 

-0.81 

0.43 



F 

0.15 0.17 

Ala 

359 


B 



T 


-1.06 

0.41 



F 

-0.05 0.33 

Gly 

360 


B 



T 


-1.28 

0.27 




0.10 0.33 

Ala 

361 


B 



T 


-1.28 

0.53 




-0.20 0.28 

Thr 

362 


B 

B 




-1.63 

0.74 




-0.60 0.20 

Ala 

363 


B 

B 




-2.10 

1.03 




-0.60 0.18 

Leu 

364 


B 

B 




-1.81 

1.29 




-0.60 0.15 
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365 
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B 




-2.17 

1.17 




-0.60 0.14 

Phe 

366 


B 

B 




-2.24 

1.47 




-0.60 0.12 

Leu 

367 


B 

B 




-2.79 

1.54 




-0.60 0.08 

Ser 

368 


B 

B 




-3.09 

1.50 




-0.60 0.08 

Phe 

369 


B 

B 




-2.98 

1.54 




-0.60 0.06 

Cys 

370 


B 

B 




-3.01 

1.54 




-0.60 0.06 

Val 

371 


B 

B 




-3.17 

1.54 




-0.60 0.03 

He 

372 


B 

B 




-3.21 

1.80 

* 



-0.60 0.03 

Phe 

373 


B 

B 




-2.80 

1.66 

* 



-0.60 0.04 

lie 

374 


B 

B 




-2.40 

1.09 

* 



-0.60 0.11 

Val 

375 


B 

B 




-2.40 

0.83 




-0.26 0.20 

Val 

376 


B 

B 




-1.43 

0.71 

* 



0.08 0.12 

Arg 

377 


B 



T 


-0.50 

-0.07 

* 



1.72 0.35 

Ser 

37S 




T 

T 


0.24 

-0.76 

* 


F 

2.91 0.94 

Cys 

379 




T 

T 


0.83 

-1.40 



F 

3.40 2.53 

Arg 

380 




T 

T 


1.10 

-1.66 

* 


F 

3.06 1.73 

Lys 

381 

A 



T 



2.07 

-1.16 

* 


F 

2.32 1.30 

Lys 

382 

A 



T 



1/74 

-1.54 
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F 

1.98 4.76 

Ser 
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C 

1.46 

-1.69 

* 


F 

1.44 3.76 

Ala 
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1.53 

-1.19 

* 
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F 

0.90 1.90 

Arg 

385 

A 

B 





1.42 

-0.69 

* 

* 
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0.75 0.96 
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A 

B 





0.52 

-0.69 

* 


F 

0.90 1.20 

Ala 

387 

A 

B 





0.13 

-0.43 

* 



0.30 0.88 

Ala 

388 

A 

B 





0.43 

-0.50 

* 

* 


0.30 0.44 

Asp 

389 


B 



T 


0.13 

-0.50 

* 



0.70 0.48 

Val 

390 


B 



T 


-0.32 

-0.24 

* 



0.70 0.33 

Gly 

391 


B 



T 


-0.71 

-0.31 



F 

0.85 0.33 

Asp 

392 


B 



T 


-0.08 

-0.20 



F 

0.85 0.19 

He 

393 

A 

B 





0.51 

-0.20 



F 

0.45 0.52 

Gly 

394 

A 

B 





-0.08 

-0.84 



F 

0.75 0.88 

Met 

395 

A 

B 





0.78 

-0.77 



F 

1.01 0.53 

Lys 

396 

A 

B 





0.81 

-0.37 

* 


F 

1.12 1.22 

Asp 
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T 
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* 

* 

F 

2.08 1.78 
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T 
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2.04 1.26 
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* 
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T 


1.22 
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He 
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T 
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* 
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F 
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Gly 

403 


B 



T 
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Ser 
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T 

C 

0.53 

-0.21 
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1.92 1.40 
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c 
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* 
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C 
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Glu 
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Ser 

413 


B 


T 


1.20 

-0.03 



F 

2.36 1.36 

Trp 

414 



T 

T 


1.20 

-0.71 
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Pro 

419 




T 

c 
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Arg 

420 



T 

T 
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F 
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421 




T 
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1.59 

-0.06 




2.41 1.05 

His 
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423 

A 

B 
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A 
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-0.26 0.35 

Ala 

425 

A 
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0.47 
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-0.30 0.35 

Ala 

426 
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c 

0.16 

0.39 




0.50 0.47 
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427 




T 

c 

0.19 

0.39 




1.20 0.57 

Ser 

428 




T 

c 

0.53 
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F 

2.25 0.97 

Ser 

429 




T 

c 

1.46 
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c 

2.04 
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2.70 2.39 

Glu 

431 
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c 

1.74 
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F 

2.00 3.09 

Glu 

432 
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F 

1.70 1.62 

Arg 
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A 

B 
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-1.49 
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Glu 
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A 
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He 
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Gin 
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0.87 

-0.23 
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Tyr 
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A 
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0.46 
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Ala 
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Ser 
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Res Position 


II III 

IV 

V 

VI 

VII 

VIII 

IX 

X 

XI 

XII 

XIII XI 

Met 

1 


B 





0.43 

0.01 




-U. 1 0 0.79 


2 


B 





0.48 

-0.41 




0.65 1.21 

Leu 

3 


B 



T 


0.06 

-0.41 




0.70 0.94 

Pro 

4 

A 




T 


-0.41 

-0.16 




0.70 0.78 

Are 

5 


B 



T 


-0.88 

-0.13 



F 

0.85 0.35 

Gly 



B 



T 


-0.87 

0.51 



F 

-0.05 0.31 

Leu 

7 


B 

B 




-1.16 

0.33 




-0.30 0.20 

Val 

g 


B 

B 




-0.93 

0.81 




-0.60 0.11 

Val 

9 


B 

B 




-1.53 

1.31 




-0.60 0.11 

Ala 

10 


B 

B 




-1.94 

1.57 




-0.60 0.11 

Trr> 

1 1 


B 

B 




-2.41 

1.27 




-0.60 0.20 

Ala 

12 


B 

B 




-1.89 

1.31 




-0.60 0.22 

Leu 

13 


B 

B 




-1.24 

1.59 




-0.60 0.23 

Ser 

14 






c 

-0.73 

1.51 




-0.20 0,34 

Leu 

15 






c 

-0.84 

1.03 




-0.20 0.34 

Tm 
lip 

16 





T 

c 

-0.87 

1.31 




0.00 0.35 

Pro 

17 





T 

c 

-0.2S 

1.11 




0.00 0.38 

Gly 

18 




T 

T 


0.22 

0.73 



F 

0.35 0.77 

Phe 

19 




T 

T 


-0.18 

0.53 



F 

0.50 1.06 

Thr 

20 






c 

0.63 

0.40 



F 

0.25 0.59 

Asp 

21 






c 

0.32 

0.37 



F 

0.25 O.9o 

Thr 

22 






c 

0.53 

0.56 





Phe 

23 


B 





0.57 

-0.23 

* 

* 


1.33 1.27 

Asn 

24 


B 



T 


1.38 

-0.23 

* 

* 


1.87 1.10 

Met 

25 




T 

T 


1.73 

-0.23 

* 

* 


2.61 1.49 

Asp 

26 




T 

T 


1.52 

-0.71 

* 


F 

3.40 3.44 

Thr 

27 




T 

T 


1.94 

-1.07 


* 

F 

3.06 3.31 

Arg 

2S 




T 



1.79 

-1,47 

* 


F 

2.52 6.55 

Lys 

29 


B 





0.90 

-1.44 

* 


F 

1.78 2.91 

Pro 

30 


B 

B 




1.29 

-0.76 

* 


F 

1.24 1.41 

Arg 

31 


B 

B 




0.94 

-0.81 



F 

0.90 1.12 

Val 

32 


B 

B 




0.96 

-0.39 


* 


0.30 0.55 

He 

33 


B 



T 


0.96 

0.00 


* 

F 

0.22 0.48 

Pro 

.34 


B 



T 


0.60 

-0.43 


* 

F 

0.79 0.48 

Gly 

35 




T 

T 


0.22 

0.06 

* 

* 

F 

0.56 0.93 

Ser 

36 


B 



T 


-0.59 

-0.09 

* 

* 

F 

0.88 1.34 
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37 


g 





-0.43 

0.01 


F 

-0.30 0.75 

Thr 

38 



B 




0.1 1 

0.37 


F 

-0.27 0.66 


39 


B 

B 




0.08 

0.37 



-0.39 0.49 

Phe 



B 

B 




0.11 

0.74 



-0.66 0.39 

Phc 

41 


B 

B 




-0.44 

1.23 



-0.63 0.39 

Gly 

42 


B 

B 




-0.56 

1.39 



-0.60 0.29 

Tyr 

43 


B 

B 




-0.24 

1.29 



-0.60 0.57 

Thr 

44 
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T 


1.49 

-0.96 

* 

* 

F 

2.91 0.89 

T 


1.38 

-1.46 

* 


F 

3.40 2.12 

T 


1.10 

-1.36 

* 


F 

3.06 1.10 



1.91 

-0.87 

* 


F 

2.37 0.92 



2.02 

-0.47 


* 

F 

1.48 1.49 



1.43 

-0.90 

* 

* 

F 

1.44 2.95 



0.92 

-0.83 

* 

* 

F 

1.10 1.79 



0.46 

-0.19 

* 

* 

F 

0.45 0.68 



-0.36 

0.50 

* 

* 

F 

-0.45 0.64 



-0.77 

0.40 

* 

* 


-0.60 0.37 



-0.37 

0.30 




-0.06 0.31 



-0.40 

0.29 




0.18 0.21 



-0.40 

0.71 



F 

0.27 0.21 

T 

c 

-0.51 

0.61 



F 

1.11 0.48 

T 

c 

-0.73 

-0.03 



F 

2.40 1.01 

T 

c 

-0.54 

-0.03 

* 


F 

2.16 1.01 

T 


0.27 

-0.14 

* 


F 

1.57 0.40 



1.19 

-0.53 

* 


F 

1.23 0.52 



0.60 

-0.91 

* 

* 

F 

1.14 1.03 



0.90 

-0.66 

* 

* 


0.60 0.42 



0.54 

-0.66 

* 

* 


0.60 0.39 



0.51 

0.13 

* 



-0.30 0.41 



-0.34 

-0.06 

* 

* 


0.45 1 .03 



-0.34 

0.01 




-0.30 0.44 
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A 
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A 
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-0.60 0.40 
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A 

B 





0.20 

0.60 




-0.60 0.46 

Asp 

738 

A 

A 
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-0.19 



F 

0.62 0.57 

Thr 

739 

A 





T 


-0.20 

0.07 



F 

0.74 1.20 
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740 

A 





T 


-0.12 

0.07 



F 

0.91 1.08 

Asp 

741 

A 





T 


0.52 

-0.61 




1.83 1.30 

Tyr 

742 



B 



T 


0.48 

-0.19 




1.70 1.39 

Val 

743 



B 

B 




0.17 

-0.03 




1.13 1.02 

Lys 

744 



B 

B 




-0.22 

-0.04 



F 

0.96 0.88 
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745 



B 

B 




0.07 
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* 


F 
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746 



B 

B 




-0.79 
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Thr 

747 



B 
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-0.54 

0.37 




-0.30 0.33 

Phe 

748 



B 

B 




0.07 

0.37 


* 


-0.30 0.37 

Ser 

749 



B 

B 




-0.28 

0.70 


* 


-0.60 0.77 

Val 

750 



B 

B 




-0.88 

0.44 


* 


-0.60 0.72 

Glu 

751 



B 

B 




-0.02 

0.64 


* 


-0.60 0.68 

Tyr 

752 



B 





0.29 

-0.14 


* 


0.50 0.88 

Ser 

753 







C 

0.78 

-0.53 


* 


1.49 1.99 

Leu 

754 





T 



1.08 

-0.74 

* 



2.03 1.78 

Glu 

755 

A 







1.90 

-0.74 

* 


F 

2.12 1.89 

Asp 

756 

A 





T 


1.56 

-1.00 



F 

2.66 1.92 

Pro 

757 





T 

T 


1.59 

-0.96 



F 

3.40 2.31 

Asp 

758 





T 

T 


1.29 

-1.21 



F 

3.06 2.06 

His 

759 






T 

C 

1.29 

-0.60 



F 

2.52 1.22 

Gly 

760 







C 

1.29 

0.09 



F 

0.93 0.65 

Pro 

761 



B 





1.29 

-0.34 



F 

0.99 0.65 

Met 

762 



B 





1.16 

-0.34 




0.50 0.80 

Leu 

763 



B 





0-87 

-0.41 



F 

0.89 0.80 

Asp 

764 





T 

T 


0.69 

0.07 



F 

1.13 0.54 

Asp 

765 





T 

T 


0.72 

0.07 



F 

1.37 0.85 : 

Gly 

766 





T 

T 


0.62 

-0.06 



F 

2.36 1.48 

Trp 

767 






T 

C 

0.41 

-0.26 

* 

* 

F 

2.40 1.28 

Pro 

768 




B 



C 

1.33 

0.43 

* 

* 

F 

0.71 0.63 

Thr 

769 



B 

B 




0.48 

0.43 

* 

* 

F 

0.42 1.25 

Thr 

770 



B 

B 




0.18 

0.64 


* 

F 

0.03 0.89 

Leu 

771 



B 

B 




-0.33 

0.11 


* 


-0.06 0.77 

Arg 

772 



B 

B 




-0.26 

0.33 

* 

* 


-0.30 0.39 

Val 

773 



B 

B 




-0.74 

0.27 

* 



-0.30 0.42 

Ser 

774 



B 

B 




-0.72 

0.57 

* 



-0.60 0.44 
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775 
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-0.41 

0.80 

* 

* 


-0.60 0.24 
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776 



B 

B 




0.06 

1.20 

* 



-0.60 0.52 

Phe 

777 




B 

T 



-0.72 

0.99 

* 

* 


-0.20 0.38 

Tru 

778 





T 

T 


0.13 

1.17 




0.20 0.28 


779 






T 

c 

0.43 

0.93 




0.00 0.29 

Gly 

780 





T 

T 


1.29 

0.50 




n in ft ^7 

Cys 

781 





T 

T 


1.50 

-0.29 



p 

1.25 0.91 


782 





T 



2.17 

-1.20 



p 

I 15 0.98 

Glu 

783 





T 



1.79 

-1.10 



p 

1.30 1.35 

Asp 

784 


A 



T 



0.93 

-0.96 



p 

1.30 1.35 

Glu 

785 





T 



1.07 

-0.89 



p 

1.15 0.62 

His 

786 





T 



1.73 

-0.86 

* 




Cys 

787 








0.92 

-0.86 

* 



0.60 0.56 










0.07 

-0.17 





Pro 

789 








-0.74 

0.47 



p 

-0.25 0.14 










-0.74 

0.66 



p 

-0.45 0.22 

iZ 









-1 30 

0 09 





Val 

792 








-0,52 

-0.06 

* 

* 


0.30 0.33 


793 








0.03 

-0,49 

* 

* 


kJ.JV/ U.JO 

Asp 

794 


A 











n •art rt #19 

Ala 

795 

A 

A 











n on i 

Arg 

796 






T 


0 03 

-0.74 


* 

p 

1.30 1.40 








T 


0.58 

-1 .00 


* 

p 

1.30 1.29 

Asp 

798 

A 





T 

Q 

0.80 

-0.51 


* 

p 

1.50 1.85 

Leu 

799 






T 


0.20 

-0.51 



p 

1.35 0.95 

Pro 

800 







Q 

0.79 

0.10 

* 

* 

F 

0.05 0.70 

Thr 

801 








0.43 

-0.29 


* 


0.30 0.73 

Ala 

802 








0.07 

0.47 




-0.45 1.39 

Met 

803 








0.07 

0.36 

* 



-0.30 0.48 

Glu 

804 








0.99 

033 

* 



-0.30 0.58 

Tvr 
iyr 

805 




g 




0.34 

-0.16 

* 



0.45 1.12 

Cys 

806 

A 



B 




-0.16 

-0.01 




ft in n ha 

Gin 

807 

A 



B 




0.54 

0.06 

* 



-0.30 0.40 

Arg 

80S 

A 



B 




1.19 

0.06 

* 



-0.30 0.50 

Val 

809 

A 



B 




0.9S 

-0.70 




0.75 1.86 

Leu 

810 



B 

B 




0.63 

-0.84 

* 


F 

1.20 1.66 

Arg 

8U 



B 

B 




1.30 

-0.74 

* 


F 

1.35 0.86 

Lys 

812 



B 

B 




1.30 

-0.34 

* 


F 

1.50 2.00 

Pro 

813 





T 



0.52 

-0.99 

* 


F 

2.70 4.06 

Ala 

814 





T 



1.08 

-1.10 



F 

3.00 1.11 

Gin 

815 



B 



T 


1.30 

-0.71 

* 


F 

2.35 0.74 
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Asp 

816 



B 


T 

0.94 

-0.21 

* 


p 


Cys 

817 



B 


T 

0.59 

0.11 




0 70 0 75 

Ser 

818 



B 


T 

-0.01 

0.10 




n a(\ ft 

U.4U U.OJ 

Ala 

819 



B 

B 


0.28 

0.39 




-U.jU U.J 1 

Tyr 

820 



B 

B 


-0 42 

0 77 




-U.60 0.78 

Thr 

821 



B 

B 


-0 42 





-0.60 0.50 

Leu 

822 



B 

B 



0.60 




-0.60 0.83 

Ser 

823 



B 

B 



0.59 




-0.60 0.76 

Phe 

824 



B 

B 







-0.30 0.76 

Asp 

825 



B 

B 


-0.80 

0.47 



F 

-0.45 0.69 

Thr 

826 



B 

B 


-1.38 

0.57 



F 

-0.45 0.44 

Thr 

827 




B 






F 

-0.45 0.36 

Val 

828 



B 

B 







-0.60 0.15 

Phe 

829 



B 

B 







-0.60 0. 1 8 

He 

830 



B 

B 


-1,07 

0.67 




-0.60 0.17 

He 

831 



B 

B 


-0.64 

0.67 




-0.60 0.33 

Glu 

832 

A 



B 


-0.33 

0.03 



F 

-0.15 0.74 

Ser 

833 

A 





0.63 

-0.36 



F 

1.00 1.83 

Thr 

834 

A 




T 





F 

1.30 5.10 

Arg 

835 

A 




T 

0.78 

-1.09 



F 

1.30 2.19 

Gin 

836 

A 




T 

0.81 

-0.59 



F 

1.30 1.65 

Arg 

837 

A 

A 




0.81 

-0.33 



F 

0.45 0.85 

Val 

838 


A 

B 



0.52 

-0,81 




0.60 0.75 

Ala 

839 


A 

B 



0.52 

-0.31 




0.30 0.44 

Val 

840 


A 

B 



-0.40 

-0.23 




0.30 0.32 

Glu 

841 

A 

A 









-O.oU 0.36 

Ala 

842 

A 

A 




-051 





ft 1A ft CI 

Thr 

843 

A 

A 




0 46 

-029 




0.45 1.33 


844 

A 

A 








F 

0.90 1.50 

Glu 

845 

A 

A 




1 

-0 50 



F 

0.60 1.47 

Asn 

846 

A 




T 

1.56 

-1.00 


* 

p 

1 60 1 77 

Arg 

847 

A 





1.56 

-1.09 



F 

1 .90 3.45 

Gly 

848 

A 




T 

1.62 

-1.27 

* 

* 

F 

2.20 2.01 

Glu 

849 

A 




T 

2.13 

-0.51 

* 

* 

F 

2.50 1.96 

Asn 

850 





T 

C 1.82 

-0.53 

* 


F 

3.00 1.34 

Ala 

851 

A 




T 

0.97 

-0.04 

* 


F 

2.20 1.95 

Tyr 

852 



B 


T 

0.04 

0.17 

* 



1.00 0.84 

Ser 

853 



B 


T 

0.39 

0.86 

* 



0.40 0.43 

Thr 

854 



B 

B 


-0.50 

0.86 

* 



-0.30 0.68 

Val 

855 



B 

B 


-0.80 

1.04 




-0.60 0.31 

Leu 

856 



B 

B 


-0.21 

0.67 




-0.60 0.31 
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Asn 

857 



B 

B 




-0.27 

0.69 

* 



-0.60 0.37 

He 

858 



B 

B 




-0.56 

0.59 

* 


F 

-0.45 0.66 

Ser 

859 



B 





-0.24 

0.44 


* 

F 

-0.25 0.81 

Gin 

860 



B 




C 

-0.20 

0.16 


* 

F 

0.25 0.81 

Ser 

861 






T 

C 

0.61 

0.44 


* 

F 

0.15 0.96 

Ala 

862 



B 



T 


-0.09 

0.16 



F 

0.40 1.23 

Asn 

863 



B 



T 


0.21 

0.56 


'* 


-0.20 0.62 


864 

A 





T 


0.21 

0.66 




-0.20 0.47 

Gin 

865 

A 

A 






-0.60 

0.66 


* 


-0.60 0.62 

Phe 

866 

A 

A 






-1.19 

0.84 


* 


-0.60 0.32 

Ala 

867 

A 

A 






-0.60 

1.13 

* 

* 


-0.60 0.27 

Set 

868 

A 

A 






-0.56 

0.84 

* 

* 


-0.60 0.27 


869 

A 

A 






0.26 

0.44 

* 



-0.60 0.62 

He 

870 

A 

A 






0.26 

-0.34 

* 



0.45 1.07 

Gin 

871 

A 

A 






0.66 

-0.84 

* 


F 

1.24 133 


872 

A 

A 






1.24 

-0.84 

* 


F 

1.58 2.16 

Glu 

873 

A 

A 






1.20 

-1.53 



F 

1.92 5.14 


874 





T 

T 


1.71 

-1.79 


* 

F 

3.06 2.94 

Ser 

875 





T 

T 


1.71 

-1.80 


* 

F 

3.40 1.97 


876 





T 

T 


1.71 

-1.11 


* 

F 

2.91 0.80 

Glv 

877 





T 

T 


1.00 

-1.11 


* 

F 

2.57 0.83 

Ser 

878 

A 

A 






0.14 

-0-54 



F 

1.43 0.33 

He 

879 

A 

A 






0.14 

-0.29 




0.64 0.15 

Glu 

880 

A 

A 






0.44 

Oil 




-0.30 0.24 

Cys 

881 


A 

B 





0.44 

-0.31 




0.30 0.31 

Val 

882 

A 

A 






0.90 

-0.70 

* 



0.60 0.76 


883 

A 

A 






1.31 

-1.39 

* 


F 

0.75 0.86 

Glu 

884 

A 

A 






1.39 

-1.39 

* 


F 

0.90 3.15 

Glu 

885 

A 

A 






1.39 

-1.27 

* 


F 

0.90 3.50 

Arg 

886 

A 

A 






2.10 

-1.51 

* 


F 

0.90 3.77 

Arg 

887 

A 

A 






2.96 

-1.91 

* 


F 

0.904.35 

Leu 

888 

A 

A 






2.10 

-1.51 

* 


F 

0.90 4.35 - 

Gin 

889 

A 

A 






1.43 

-0.87 

* 


F 

0.90 1.65 

Lys 

890 

A 

A 






1.43 

-0.30 

* 


F 

0.45 0.45 

Gin 

891 


A 

B 





0.47 

0.10' 

* 



-0.30 0.88 

Val 

892 


A 

B 





0.06 

0.06 

* 



-030 0.38 

Cys 

893 


A 

- B 





0.62 

0.04 

* 



-0.30 0.25 

Asn 

894 



B 



T 


0.41 

0.80 

* 



-0.20 0.23 

Val 

895 



B 



T 


-0.33 

0.83 

* 



-0.20 0.48 

Ser 

896 



B 



. T 


-1.03 

0.97 
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-0.20 0.77 

Tyr 

897 
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T 


-0.07 

1.19 




-0.20 0.41 
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899 

A 

A 
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A 
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A 
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-0.57 
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* 
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A 
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A 

A 
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* 
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A 
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-0.43 
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* 
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Ala 
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A 
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-0.30 0.48 

Phe 

908 

A 

A 
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-0.59 

-0.39 


* 
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Arg 

909 

A 

A 
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0.00 
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* 

* 


-0.60 0.47 

Leu 

910 

A 
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-0.74 

-0.24 

* 

* 


0.30 0.80 

Asp 

911 

A 
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-0.19 
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* 

* 


-0.30 0.80 

Phe 

912 

A 

A 




0.44 
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0.30 0.55 

Glu 

913 

A 
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0.84 
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* 

* 


0.45 1.33 

Phe 

914 

A 

A 




-0.16 

-0.66 


* 


0.75 1.07 

Ser 

915 

A 



T 


-0.04 

0.03 


* 

F 

0.25 0.87 

Lys 

916 

A 



T 


-0.86 

0.03 


* 

F 

0.25 0.43 

Sei- 

917 

A 



T 


-0.19 

0.71 




-0.20 0.41 

He 

918 

A 



T 


-0.22 

0.43 




-0.20 0.42 

Phe 

919 

A 

A 




-0.33 

0.54 




-0.60 0.28 

Leu 
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A 

A 




-0.03 

1.23 




-0.60 0.18 

His 

921 

A 

A 




-0.97 

0.84 




-0.60 0.43 

His 

922 

A 

A 




-0.67 

0.84 


* 


-0.60 0.35 

Leu 

923 

A 

A 




-0.59 

0.06 


* 


-0.30 0.74 

Glu 

924 

A 

A 




-0.48 

0.06 


* 


-0.30 0.45 

He 

925 

A 

A 




-0.26 

0.06 


* 


-0.30 0.33 

Glu 

926 

A 

A 




-0.57 

0.06 


* 


-0.30 0.41 

Leu 

927 

A 

A 
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* 
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Ala 
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A 

A 
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Ala 
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A 
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T 


0.57 

-0.11 


* 

F 

0.85 0.69 
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931 




T 

C 

0.57 
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* 

* 

F 

1.20 1.11 
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F 
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Ser 
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T 

C 
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F 
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F 
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Glu 
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35 
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Leu 
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40 
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Thr 
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g 

B 





g 

3 


B 










B 



B 



B 

A 

B 

B 

A 

B 

B 

A 

B 

B 

A 
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A 


B 

A 


B 

A 
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A 


B 

A 


B 

A 


B 

A 


B 

A 


B 

A 
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A 
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A 


B 



B 


B 

B 


B 

B 


B 

B 
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T 
T 

T 
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C 
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-3.16 

1.61 



-3.62 

1.57 



-3,40 

1.57 



-3.40 

1.39 



-2.88 

1.39 



-2.02 

1.80 



-2.24 

1.11 



-1.78 
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-1.51 
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c 
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c 
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T 

c 
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-1.46 
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-1.46 

T 
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T 

c 

1.51 

-0.54 

T 

c 

1.44 
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T 

c 
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T 

c 
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0.33 


-0.60 

0.28 


-0.60 

0.29 


-0.60 

0.55 


-0.30 

0.37 


-0.60 

0.45 


0.30 0.89 

F 1.10 2.02 

F 1.30 4.57 

F 1.30 4.57 

F 1. 84 5.90 . 

F 2.18 9.03 

F 2.52 4.56 

F 2.86 1.92 

F 3.40 1.64 

F 3.06 1.29 

F 2.69 1.08 

F 2.22 1.08 

F 2.35 2.18 

F 1.88 1.96 
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Pro 

1185 



B 

T 

033 

-0.33 

* 

F 

1.70 0.97 

Lys 

1136 


A 

B 


0.76 

-0.33 

* * 

F 

1.28 1.08 

Val 

1187 


A 

B 


0.37 

-0.33 

* 

F 

0.96 0.96 

Leu 

1188 

A 

A 



0.19 

-0.39 

* 


0.64 0.79 

Glu 

1189 

A 

A 



0.11 

-0.39 

* 


0.47 0,51 
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A 
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-0.07 

0,04 
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Res Position 

I 

H 

III 

IV 

V 

VI 

VII 

VIII 

IX 

X 

XI 

XII 

XIII 

XI 

Met 

1 

A 

A 






-1.47 

0.70 




-0.60 

0.31 

Ala 

2 

A 

A 






-138 

0.96 




-0.60 

0.20 

Leu 

3 

A 

A 






-1.80 

0.91 




-0.60 

0.21 

Met 

4 

A 

A 






-2.27 

1.17 




-0.60 

0.17 

Leu 

5 

A 

A 






-2.69 

1.20 




-0.60 

0.13 

Ser 

6 

A 

A 






-2.39 

1.39 




-0.60 

0.13 

Leu 

7 

A 

A 






-2.61 

1.09 




-0.60 

0.17 

Val 

8 

A 

A 






-2.61 

1.16 

* 



-0.60 

0.17 

Leu 

9 

A 

A 






-1.97 

1.16 

* 



-0.60 

0.11 

Ser 

10 

A 

A 






-1.97 

0.77 

* 



-0.60 

0.26 

Leu 

11 


A 

B 





-2.01 

0.77 

* 



-0.60 

0.28 

Leu 

12 


A 

B 





-1.50 

0.56 

* 



-0.60 

0.34 

Lys 

13 


A 

B 





-0.99 

0.26 

* 


F 

-0.15 

0.34 

Leu 

14 


A 





c 

-0.18 

0.30 



F 

0.05 0.41 

Gly 

15 





T 

T 


-0.17 

0.01 

* 


F 

0.65 0.86 

Ser 

16 






T 

c 

0.64 

0.24 

* 


F 

0.45 0.45 

Gly 

17 






T 

c 

0.60 

0.64 

* 


F 

0.15 0.95 

Gin 

18 



B 



T 


-0.14 

0.60 



F 

-0.05 

0.71 

Trp 

19 



B 

B 




0.32 

0.96 




-0.60 

0.46 

Gin 

20 



B 

B 




0.46 

1.00 


* 


-0.60 

0.46 

Val 

21 



B 

B 




0.76 

1.00 




-0.60 

0.41 

Phe 

22 



B 

B 




1.14. 

0.60 


* 


-0.30 

0.65 

Gly 

23 






T 

c 

0.93 

-0.31 




1.50 0.75 

Pro ■ 

24 





T 

T 


0.37 

-0.29 



F 

2.30 1.57 

Asp 

25 






T 

c 

0.37 

-0.29 



F 

2.40 1.34 

Lys 

26 






T 

c 

0.63 

-0.67 

* 


F 

3.00 2.35 

Pro 

27 




B 



c 

0.52 

-0.60 

* 


F 

2.30 1.54 . 

Val 

28 



B 

B 




0.01 

-0.34 

* 



1.20 0.76 

Gin 

29 



B 

B 




-0.12 . 

0.30 

* 



0.30 0.28 

Ala 

30 



B 

B 




-0.12 

0.73 




-0.30 

018 

Leu 

31 



B 

B 




-0.17 

0.30 




-0.30 

0.42 

Val 

32 



B 

B 




-0.54 

-0.34 




0.30 0.41 

Giy 

33 

A 



B 




-0.28 

-0.24 



F 

0.45 0.41 

Glu 

34 

A 

A 






-0.98 

-0.24 



F 

0.45 0.50 

Asp 

35 

A 

A 






-0.69 

-0.14 



F 

0.45 0.58 

Ala 

36 

A 

A 






-0.54 

-0.40 




0.30 0.79 
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Ala 

37 

A 

A 


B 




-0.39 

-0.26 



[j.jv 

Phe 

38 

A 

A 


B 




-0.86 

0.53 



-0.60 0.13 

Ser 

39 

A 

A 


B 




-1.16 

1.21 




Cys 

40 

A 

A 


B 




-1.37 

1.10 



-u.ol) U.14 

Phe 

41 

A 

A 


B 




-0.73 

1.03 



-0.60 0.24 

Leu 

42 


A 


B 



C 

-0.46 

0.24 



-0.10 0.36 

Ser 

43 






T 

C 

0.24 

0.34 


F 

0.45 0.98 

Pro 

44 






T 

C 

-0.04 

0.17 


F 

0.60 1.82 

Lys 

45 






T 

C 

0.62 

-0.11 


F 

1.20 2.23 

Thr 

46 

A 





T 


0.73 

-0.80 


F 

1.30 2.88 

Asn 

47 

A 

A 






0.94 

-0.69 


F 

0.90 1.88 

Ala 

48 

A 

A 






1.24 

-0.50 



0.30 0.93 

Glu 

49 

A 

A 






0.60 

-0:50 



0.45 1.12 

Ala 

50 

A 

A 






0.67 

-0.34 



0.30 0.52 

Met 

51 

A 

A 






0.28 

-0.74 



0.60 1.00 

Glu 

52 

A 

A 






-0.42 

-0.46 



0.30 0.50 

Val 

53 

A 

A 






0.28 

0.33 



-0.30 0.43 

Arg 

54 

A 

A 






-0.07 

-0.17 



0.30 0.85 

Phe 

55 

A 

A 






0.52 

-0-36 



0.30 0,48 

Phe 

56 

A 





T 


0.42 

0.04 



0.25 1.13 

Arg 

57 

A 





T 


0.12 

0.19 



0.10 0.50 

Gly 

58 





T 

T 


0.68 

0.57 


F 

0.35 0.77 

Gin 

59 





T 

T 


-0.29 

0.17 


F 

0.80 1.20 

Phe 

60 




B 



C 

-0.44 

0.03 


F 

0.05 0.45 

Ser 

61 




B 



c 

0.22 

0.67 


F 

-0.25 0.34 

Ser 

62 



B 

B 




-0.70 

0.74 



-0.60 0.27 

Val 

63 



B 

B 




-0.60 

1.03 



r\ JTfh c\ n c 

-U.oU u./d 

Val 

64 



B 

B 




-0.49 

1.00 



-U.OU OoU 

His 

65 



B 

B 




0.21 

0.61 



-0.26 0.43 

Leu 

66 



B 

B 




0.17 

0.23 



nion no 

u.jo u.ys 

Tyr 

67 



B 



T 


0.51 

0.01 



1.27 1.30 

Arg 

68 





T 

T 


1.37 

-0.63 


F 

3.06 1.92 

Asp 

69 





T 

T 


2.22 

-1.13 

* 

F 

3.40 3.88 

Gly 

70 





T 

T 


2.04 

-1.41 

* 

F 

3.06 4.29 

Lys 

71 





T 



2.16 

-1.74 

* 

F 

2.52 3.39 

Asp 

72 







c 

1.80 

-0.96 


F 

1.98 1.76 

Gin 

73 







c 

1.69 

-0.34 


F 

1.34 1.76 

Pro 

74 



B 





1.09 

-0.37 


F 

0.80 1.52 

Phe 

75 



B 





1.22 

0.24 



-0.10 0.90 

Met 

76 



B 





1.18 

0.67 



-0.40 0.80 

Gin 

77 



B 





0.93 

0.67 



-0.40 0.90 
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0 09 2 85 
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T 

1.61 

0.43 



0.98 1.63 

Tyr 

81 





T T 

1.90 

0.03 




Gin 

82 

A 




T 

1.94 

-0.10 


F 

2.36 3.01 

Gly 

83 





T T 

1.73 

-0.53 


F 

3.40 3.48 

Arg 

84 



B 


T 

-1.09 

-0.24 


F 

2.36 1.83 

Thr 

85 



B 



1.13 

-0.36 


F 

] .90 0.7S 

Lys 

86 



B 



1.38 

-0.76 


F 

2.24 1.59 

Leu 

87 



B 



1.08 

-1.19 


F 

2.13 1.35 

Val 

88 



B 


T 

0.53 

-0.80 


F 

2.22 1.26 

Lys 

89 



B 


T 

-0.17 

-0.60 


F 

2.30 0.44 

Asp 

90 



B 


T 

0.14 

-0.10 


F 

1 .77 0.54 

Ser 

91 



B 


T 

-0.24 

-0.79 



1.34 1.26 

lie 

92 

A 

A 




0.68 

-1.00 



1.06 0.62 

Ala 

93 

A 

A 




0.64 

-1.00 


F 

0.98 0.73 

Glu 

94 

A 

A 




0.30 

-0.31 


F 

0.45 0.38 

Gly 

95 

A 

A 




-0.51 

-0.31 


F 

0.45 0/73 

Arg 

96 

A 

A 




-0.10 

-0.31 


F 

0.45 0.60 

He 

97 

A 

A 




-0.02 

-0.81 


F 

0.75 0.67 

Ser 

98 

A 

A 




0.57 

-0.13 



0.30 0.56 

Leu 

99 

A 

A 




0.57 

-0.56 



0.60 0.50 

Arg 

100 

A 

A 




0.02 

-0.16 



0.45 1.14 

Leu 

101 

A 

A 




-0.40 

-0.16 



0.30 0.60 

Glu 

102 


A 

B 



-0.37 

-0.06 



0.45 1.04 

Asn 

103 


A 

B 



-0.88 

-0.10 



0.30 0.40 

lie 

104 


A 

B 



-0.07 

0.59 



-U.Olf U.40 

Thr 

105 


A 

B 



-0.77 

-0.10 



n in ft ifl 
0.3U U.jo 

Val 

106 


A 

B 



-0.30 

0.40 



_n An ft ")A 

Leu 

107 


A 

B 



-1.11 

0,43 



ft An n 

Asp 

108 


A 

B 



-1.36 

0.43 



ft Aft ft tft 

Ala 

109 


A 

B 



-0.81 

0.70 



-U.dU U.41 

Gly 

110 





T 

-1.17 

0.49 

* 


0.00 0.49 

Leu 

111 



B 


T 

-0.20 

0.37 

* 


0.10 0.16 

Tyr 

112 



B 


T 

-0.28 

0.37 

* * 


0.10 0.30 

Gly 

113 



B 


T 

-0.5S 

0.56 

* # 


-0.20 0.22 

Cys 

114 



B 


T 

-6.29 

0.51 

* * 


-0.20 0.35 

Arg 

115 



B 

B 


0.06 

0.21 

* * 


-0.30 0.30 

He 

116 



B 

B 


0.57 

-0.14 

* * 

F 

0.45 0.52 

Ser 

in 



B 

B 


0.57 

-0.19 

* * 

F 

0.76 1.31 

Ser 

118 



B 


T 

0.67 

0.00 

* * 

F 

1.32 1.05 
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T 


1.33 

0.76 


* 

p 

0.58 2.35 


Ser 

120 





T 

T 


1.27 

0.47 


* 

F 

1.14 3.03 


Tyr 

121 





T 

T 









Tyr 

122 


A 



T 



0.98 

0 20 





5 

Gin 

123 


A 

B 





0.99 

0.49 




U.U3 I. its 


Lys 

124 


A 

B 





0.99 





-0 28 0 93 


Ala 

125 


A 

B 





0.48 

0.26 




0.01 1.02 


He 

126 


A 

B 





0.72 







Tn> 

127 


A 

B 





0.1 1 

0.19 




ft 1f\ ft yll 

10 

GIu 

128 

A 

A 






-0.19 

0.83 




-U.OU U.Jl 


Leu 

129 

A 

A 






-0.82 

0.71 




n £.t\ ft «o 


Gin 

130 


A 

B 





-1.04 

0.53 




-0.60 0.57 


Val 

131 


A 

B 





-0.50 

0.30 






Ser 

132 


A 





C 

-0.51 

0.73 




-U.4U U.J 3 

15 

Ala 

133 


A 





C 

-1.37 

0.43 




n An n 


Leu 

134 


A 

B 





-0.77 

0.67 




-U.OU 


Giy 

135 


A 






-1.58 

0.46 






Ser 

136 



B 

B 




-1.61 

0.76 




-U.OU 


Val 

137 



B 

B 




-1.61 

0.94 




-U.OU U.ZU 

20 

Pro 

138 



B 

B 




-1.91 

0.64 




n Aft ft n 
-U.OU XJ.J. S 


Leu 

139 



B 

B 




-1.69 

0.90 




r\ £i[\ ft \ A 

-U.OU U.14 


He 

140 



B 

B 




-1.69 

1.01 




t\ tin n in 
-U.OU U. IV 


Ser 

141 



B 

B 




-1.63 

0.80 




-U.OU U.1Z 


lie 

142 



B 

B 




-1.63 

1.13 




ft ft. OA 
-U.OU U./4 

25 

Ala 

143 



B 

B 




-1.42 

L09 




ft f.f\ CI n 


Gly 

144 



B 

B 




-0.50 







Tyr 

145 



B 

B 




039 

0.01 




0 22 0 87 


Val 

146 



B 

B 




-0.20 

-0.67 






Asp 

147 



B 



T 


0.69 





2 04 1 02 

30 

Arg 

14S 



B 



T 


0.47 

-0 51 

* 


p 

2.60 1.13 


Asp 

149 



B 



T 



-0 59 

* 


p 

2.34 1.25 


lie 

150 



B 





-0.42 

-0.54 

* 



1.78 0.62 


Gin 

151 


A 

B 





0.43 

0.03 

* 



0.22 0.17 


Leu 

152 


A 

B 





0.13 

0-43 




-0.34 0.18 

35 

Leu 

153 


A 

B 





-0.28 

0.81 

* 



-0.60 0.34 


Cys 

154 


A 

B 





-0.62 

0.51 


* 


-0.60 0.26 


Gin 

155 


A 



T 



-0.02 

0.54 

* 

* 

F 

-0.05 0.31 


Ser 

156 





T 

T 


-0.72 

0.77 



F 

0.35 0.40 


Ser 

157 





T 

T 


-0.12 

0.87 

* 


F 

0.35 0.64 

40 

Gly 

158 





T 

T 


0.80 

0.73 



F 

0.35 0.57 


Trp 

159 





T 

T 


1.26 

0.33 

* 


F 

0.65 0.84 
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Phe 

160 




T 

c 

0.94 

0.37 

* 


F 

0.45 0.97 

Pro 

161 




T 

c 

0.66 

0.47 

# 

* 

F 

0.30 1.41 

Arg 

162 




T 

c 

1.00 

0.54 

* 

* 

F 

0.30 1.36 

Pro 

163 



T 

T 


1.06 

-0.37 


* 

F 

1.40 3.13 

Thr 




T 



1.39 

-0.24 

* 

* 

F 

1,20 2.13 

Ala 




T 



1.74 

-0.67 

* 

* 

F 

1.50 2.17 

Lys 

166 



T 



1.74 

-0.24 


* 

p 

1.20 1.39 

Trp 

167 






1.63 

-0.24 


* 

p 

1.54 1.49 

Lys 

168 






1.50 

-0.33 


* 

F 

1 .68 2.55 

Gly 

169 





Q 

1.81 

-0.40 



p 

2.02 1 .26 

Pro 

170 





c 

2.40 

0.00 


* 

F 

1.96 2.08 

Gin 

171 



T 

T 


1.54 

-0.91 


* 

F 

3.40 1.74 

Gly 

172 






1.53 

-0.23 



F 

2.56 1.45 

Gin 

173 


B 




1.18 

-0.27 



p 

2.02 1.26 

Asp 

174 






1.52 

-0.21 



p 

1.82 1.05 

Leu 

175 






1.43 

-0.61 

* 

* 

F 

2.12 1.77 

Ser 

176 


B 




1.54 

-0.66 

* 

* 

p 

2.32 1.37 

Thr 

177 


B 





-1.06 

* 

* 

p 

2.66 1.60 

Asp 

178 




T 


1.58 

-0.57 

* 

* 

p 

3.40 2.80 

Ser 

179 




T 


1 69 

-0 86 

* 

* 

p 

2.86 3.37 

Arg 

180 



T 

T 




* 

* 

p 

3.06 4.57 

Thr 

181 



T 

T 


2.20 

-1.73 

* 


p 

3.06 4.57 

Asn 

182 



T 

T 



-1.11 

* 


p 

3.06 3.37 

Arg 

183 


B 




2.13 

-1.00 

* 


F 

2.66 2.34 

Asp 

184 



T 

T 


1 62 

-0.57 

* 


p 

3.40 1.61 

Met 

185 




T 


0.81 

-0.37 

* 



2.06 0.82 

His 

186 




T 


1.12 

0.01 

* 



1.12 0.36 

Gly 

187 




T 


0.27 

0.01 

* 

* 


0.78 0.36 

Leu 

188 






0.16 

0.66 

* 

* 


-0.26 0.27 

Phe 

189 A 





-0.73 

0.04 


* 


-0.30 0.35 

Asp 

190 A 


B 



-0.43 

0.23 


* 


-0.30 0.25 

Val 

191 A 


B 



-1.21 

0.19 


* 


-0.30 0.40 

Glu 

192 A 


B 



-1.18 

0.19 


* 


-0.30 0.38 

lie 

193 A 


B 



-1.22 

-0.11 


m 


0.30 0.33 

Ser 

194 A 


B 



-0.52 

0.53 


* 


-0.60 033 

Leu 

195 A A 


B 



-0-52 

0.29 


* 


-0.30 0.33 

Thr 

196 A A 


B 



0.33 

0.29 




-0.30 0.81 

Val 

197 A A 


B 



-0.26 

0.00 




0.55 0.98 

Gin 

198 A A 


B 



0.29 

0.11 


* 

F 

0.50 1.20 

Glu 

199 

\ A 


B 



0.29 

-0.14 



F 

1.20 0.82 

Asn 

200 



T 

T 


0.21 

-0.24 



F 

2.40 1.48 
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Ala 

201 



T 

T 

0.22 

-0.20 


F 

2.50 0.60 


202 



T 

T 

0.41 

-0.21 


F 

2.25 0.46 


203 



T 

T 

0.11 

0.36 


F 

1.40 0.15 


204 





. -0.49 

0.34 

* * 


0.40 0.21 

Ser 

205 





-0.38 

0.46 

* * 


-0.15 0.21 

Cys 

206 


B 



0.18 

0.03 

* * 


-0-10 0.30 

Ser 



A B 



-0.07 

0.14 

* * 


-0.30 0.58 

Met 

208 

A 




0.20 

-0.04 

m * 


0.30 0.44 

Arg 

209 





0.28 

0.07 

* * 


-0.15 1.11 

His 

210 

A 




0.28 

0.19 



-0.30 0.69 

Ala 

211 

A 




1.06 

0.19 



-0.30 0.93 

His 

212 





1.36 

-0.43 

* 


0.30 0.93 

Leu 

213 

A 




1.10 

-0.43 

* 


0.45 1.18 

Ser 

214 





0.99 

-0.29 

* 


0.30 0.S7 

Arg 

215 


A 



0.72 

-0.79 

* * 

F 

0.90 1.10 

Glu 

216 

A 

A 



1.42 

-0.90 

* * 

F 

0.90 1.79 

Val 

217 



B 


0.60 

-1.59 

* * 

F 

0.90 2.62 

Glu 

21 S 

A 


B 


1.41 

-1.33 

* * 

F 

0.75 0.99 

Ser 

219 

A 


B 


0.82 

-0.93 

* * 

F 

0.75 0.99 

Arg 

220 


B 



0.37 

-0.24 

* * 

F 

0.45 0.94 

Val 

221 


B 

B 


0.37 

-0.46 

* 

F 

0.45 0.54 

Gin 

222 

A 


B 


0.93 

-0.46 

* 


0.64 0.67 

He 

223 

A 



T 

1.04 

0.07 

* * 


0.78 0.36 

Gly 

224 

A 



T 

1.46 

0.07 

* * 

F 

1.27 0.95 

Asp 

225 



T 

T 

1.39 

-0.57 

* 

F 

3.06 1.07 

Trp 

226 



T 

T ' 

2.21 

-0.97 


F 

3.40 3.06 

Arg 

227 





1.87 

-1.16 

* 

F 

2.46 4,20 

Arg 

228 



T 

T 

2.76 

-1.16 

* 

F 

2.72 2.49 

Lys 

229 



T " 

T 

2.51 

-0.76 

* 

F 

2.38 4.10 

His 




T 

T 

2.17 

-1.17 

* 

F 

2.38 2.12 

Gly 

231 




T 

C 2.50 

-0.74 

* 

F 

2.18 1.07 

Gin 

232 



T 


. 2.50 

-0.74 

* 

F 

2.52 1.07 

Ala 

233 





C 2.43 

-0.74 

* 

F 

2.66 1.54 

Gly 

234 



T 

T 

2.14 

-1.24 

* 

F 

3.40 3.11 

Lys 

235 


B 


T 

1.88 



p 

2.66 2.81 

Arg 

236 



T 

T 

1.92 

-0.93 

* 

F 

2.77 3.73 

Lys 

237 



T 

T 

1-62 

-1.04 

* 

F 

2.48 5.05 

Tyr 

238 


B 


T 

2.18 

-1.09 


F 

1.79 3.39 

Ser 

239 


B 


T 

1.63 

-0.59 


F 

1.50 2.35 

Ser 

240 


B 


T 

1.34 

0.10 


F 

0.50 0.82 

Ser 

241 


B 


T 

1.23 

0.86 


F 

0.15 0.82 
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0.10 



0.20 1.03 

lie 

243 








0.43 

0.10 



0.15 1.03 

TVr 

lyr 

244 



B 





0.52 

0.50 



-0.35 0.66 


245 








0.52 

0.54 



-0.40 0.75 

Ser 

246 



B 





0.01 

0.43 



F -0.10 1.44 

Phe 




B 



T 


-0.26 

0.43 



F -0.05 0.76 

Pro 

248 






T 

C 

-0.07 

0.06 



F 0.45 0.61 


249 






T 

C 

-0.42 

0.84 



F 0.15 0.39 








T 

c 

-0.42 

1.07 



0.00 0.45 

Ser 

251 



B 





-0.82 

0.29 



-0.10 0.49 

Phe 

252 



B 

B 




-0.37 

0.64 



-0.60 0.31 


253 



B 

B 




-1.04 

1.01 



-0.60 0.60 

Asp 

254 



B 

B 




-1.56 

1.01 



-0.60 0.31 

Phe 

255 



B 

B 




-0.63 

1.31 



-0.60 0.30 

Tyr 

256 



B 

B 




-0.54 

0.53 



-0.60 0.59 

lie 

257 



B 

B 




-0.70 

0.34 



-0.30 0.54 

Leu 

258 



B 

B 




-0.44 

0.99 



-0.60 0.47 

Arg 

259 



B 

B 




-0.66 

0.63 



-0.35 0.29 

Pro 

260 




B 

T 



-0.62 

0.30 



F 0.75 0.65 

Val 

261 




B 

T 



-0.27 

0.19 

* 

* 

F 1.00 0.42 

Gly 

262 






T 

c 

0.03 

-0.50 

* 

* 

F 2.35 0.42 

Pro 

263 





T 

T 


0.89 

0.00 

* 

* 

F 2.50 0.28 

Cys 

264 



B 



T 


-0.03 

-0.43 

* 

* 

F 1 .85 0.74 

Arg 

265 



B 



T 


-0.68 

-0.39 



1.45 0.62 

Ala 

266 


A 

B 





-0.42 

-0.17 


* 

0.80 0.30 

Lys 

267 


A 

B 





-0,42 

0.01 



-0.05 0.55 

Leu 

268 


A 

B 





-0.52 

-0.13 



0.30 0.28 

Val 

269 


A 

B 





-0.67 

0.36 



-0.30 0.40 


270 

A 

A 






-0.73 

0.54 



-0.60 0.16 

Gly 

271 

A 

A 






-0.96 

0.54 



-0.60 0.40 

Thr 

272 

A 

A 






-1.00 

0.54 



-0.60 0.44 

Leu 

273 

A 

A 






-1.08 

0.30 


* 

-0.30 0.77 

Lys 

274 

A 

A 






-1.03 

0.37 


* 

-0.30 0.55 

Leu 

275 

A 

A 






-0.78 

0.63 


* 

-0.60 0.31 

Gin 

276 

A 

A 






-0.43 

0.57 


* 

-0.60 0.37 

He 

277 


A 

B 





-0.98 

-0.11 


* 

0.30 0.32 

Leu 

278 

A 

A 






-0.20 

0.53 


* 

-0.60 0.29 

Gly 

279 

A 

A 






-0.94 

0.34 



-0.30 0.23 

Glu 

280 

A 

A 






-0.99 

0.73 


* 

-0.60 0.28 

Val 

281 

A 

A 






-0.99 

0.69 

* 

* 

-0.60 0.26 

His 

282 

A 

A 






-0.06 

0.00 

* 


0.30 0.45 
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0.54 

-0.43 



0.30 0.52 

Val 

284 








0.86 

0.00 

* 


0.45 1.07 



A 







0.56 

-0J4 


p 

0.60 1.07 

Lys 

286 

A 





T 


0.60 

-0.26 

* 

p 

1.00 1.66 

Pro 

287 








-0.18 

-0.36 

* 

p 

1.00 1.85 

His 

288 






T 


0 52 

-0.31 

* 

p 

n d« n se 
U.OJ u.oo 

Ser 

289 

A 





T 


0.49 

0,09 



0.10 0.76 

Leu 

290 

A 



B 








-0.60 0.35 

Leu 

291 



B 

B 




-0.20 

0.73 



-U.OU 

Gin 

292 



B 

B 




-0.33 




n fin n 

He 

293 



B 

B 








n ak ft 

Ser 

294 



B 



T 






0 25 0 49 

Gly 

295 





T 

T 


-0.1 1 




0.65 0.41 

Gly 

296 





T 

T 


-0.1 1 

0.29 


F 

n ft &.& 

Ser 

297 






T 

C 

-0.07 





Thr 

298 



B 

B 








0 90 1 04 

Thr 

299 



B 

B 




0.82 

-0.53 


F 


Leu 

300 



B 

B 




0.96 

-0.53 


F 

1.80 1.56 

Lys 

301 




B 

T 



130 

-0.49 


F 

2.20 1.67 

Lys 

302 





T 



1.39 

-0.57 


F 

3.U0 l.oO 

Gly 

303 






T 

c 

1.41 

-0.63 


F 

2.70 3.49 

Pro 

304 






T 

c 

1 .42 

-0.40 




Asn 

305 






T 

c 

1.53 

-0.01 


F 

1.80 1.23 

Pro 

306 





T 

T 


1.28 

0.77 


F 

U.oU l.Uo 

Trp 

307 





T 



0.93 

0.77 




Ser 

308 







c 





-0.20 0.90 

Phe 

309 



B 





0.61 

0 76 


p 

n-is n on 

Pro 

310 



B 





0.02 

0.90 


F 

-0.25 0.46 

Ser 

311 






T 

c 

-0.58 

0.49 


F 

0.15 0.34 

Pro 

312 





T 

T 


-0.99 

0.79 


F 

0.35 0.33 

Cys 

313 





T 

T 


-0.90 

0.79 



0.20 0.18 

Ala 

314 



B 


T 

T 


-0.51 

0.79 



0.20 0.21 

Leu 

315 



B 





-0.69 

0.89 



-0.40 0.20 

Phe 

316 



B 





-0.78 

0.89 



-0.40 0.47 

Pro 

317 



B 





-0.96 

0.74 



-0.40 0.60 

Thr 

318 



B 





-0.68 

0.67 



-0.40 0.93 

Ter 

319 



B 





-0.48 

0.41 



-0.25 1.37 
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Table IX 


Res Position 

. I 

II 

III 

IV 

V 

Met 

1 



B 



Ala 

2 



B 



Gly 

3 



B 



He 

4 



B 



Pro 

5 



B 



Gly 

6 



B 

B 


Leu 

7 



B 

B 


Leu 

& 



B 

B 


Phe 

9 



B 

B 


Leu 

10 



B 

B 


Leu 

11 



B 

B 


Phe 

12 



B 

B 


Phe 

13 



B 

B 


Leu 

14 



B 

B 


Leu 

15 



B 

B 


Cys 

16 



B 

B 


Ala 

17 



B 

B 


Val 

IS 



B 

B 


Gly 

19 



B 

B 


Gin 

20 



B 

B 


Val 

21 



B 

B 


Ser 

22 



B 



Pro 

23 



B 



Tyr 

24 





T 

Ser 

25 






Ala 

26 






Pro 

27 





T 

Tip 

28 





T 

Lys 

29 






Pro 

30 





T 

Thr 

31 





T 

Tip 

32 



B 



Pro 

33 



B 



Ala 

34 





T 

Tyr 

35 





T 

Arg 

36 



B 




VI 

VII 

VIII 

IX 

X 

XI 

XII 

XIII XIV 



-0.69 

0.51 




-0.40 0.26 



-0.64 

0.51 




-0.40 0.31 



-1.07 

0.51 




-0.40 0.24 



-1.49 

0.77 




-0.40 0.20 



-1.80 

0.84 



F 

-0.25 0.17 



-2.01 

1.13 




-0.60 0.14 



-2.23 

1.39 




-0.60 0.17 



-2.59 

1.39 




-0.60 0.09 



-2.40 

1.74 




-0.60 0.08 



-3.00 

2.10 




-0.60 0.08 



-3.47 

2.10 




-0.60 0.08 



-3.32 

2.10 




-O.60 0.08 



-3.10 

1.89 




-0.60 0.05 



-3.26 

1.70 




-0.60 0.06 



-2.79 

1.66 




-0.60 0.05 



-1.98 

1.30 




-0.60 0.06 



-2.13 

0.91 




-0.20 0.13 



-1.73 

0.87 




-0.60 0.12 



-1.13 

0.57 




-0.60 0.29 



-0.57 

0.43 



F 

-0.45 0.45 



-0.20 

0.69 



F 

-0.45 0.95 

T 


-0.20 

0.43 



F 

0.10 1.28 

T 


0.44 

0.50 



F 

-0.05 0.75 

T 


0.50 

0.53 


* 


0.35 1.56 

T 

C 

0.54 

0.80 


* 


0.15 1.22 


c 

1.19 

0.41 




-0.05 1.58 



1.18 

0.41 


* 


0.15 1.56 . 



1.10 

0.14 


* 


0.45 1.68 

T 

c 

1.13 

0.67 


* 

F 

0.30 1.75 

T 


0.84 

0.60 


* 

F 

0.50 1.75 

T 


1.19 

0.67 

* 

* 

F 

0.50 1.68 

T 


1.51 

0.51 

* 

* 


-0.05 1.32 

T 


0.99 

0.51 


* 


-0.05 1 .67 

T 


0.73 

0.77 

* 

* 


0.20 0.95 

T 


0.09 

0.71 




0.35 1.40 

T 


-0.46 

0.44 


* 


-0.20 0.67 
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B 

Q 
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B 



Gin 

43 



B 



Ser 

44 



B 



Thr 

45 






Leu 

46 



B 



Asn 

47 



B 



Leu 

48 



B 



Ala 

49 






Lys 

50 






Pro 

51 






Asp 

52 






Phe 

53 



B 



GLy 

54 

A 

A 




Ala 

55 

A 

A 




Glu 

56 

A 

A 




Ala 

57 

A 

A 




Lys 

58 

A 

A 




Leu 

59 


A 

B 



Glu 

60 


A 

B 



Val 

61 


A 

B 



Ser 

62 





T 

Ser 

63 





T 

Ser 

64 





T 

Cys 

65 





T 

Gly 

66 






Pro 

67 





T 

Gin 

68 





T 

Cys 

69 






His 

70 





T 

Lys 

71 





T 

Gly 

72 





T 

Thr 

73 






Pro 

74 






Leu 

75 






Pro 
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-0.98 

0.66 


* 


-0.60 0.49 



-0.98 

0.84 


* 


-0.60 0.26 



-0.39 

0.51 

* 

* 


-0.60 0.21 



-0.44 

0.91 


* 


-0.60 0.43 



-0.87 

0.61 

* 

* 

F 

-0.45 0.37 

T 


-0.87 

0.67 



p 

-0.05 0.73 

T 


-0 66 

0 71 


* 

p 

-0.05 0.81 

T 



0 47 



p 

0.10 1.57 

T 


-0.34 

0 47 

* 


p 

-fi n ft4 

- U*uJ Uilrr 




0 54 

* 


p 

-0.25 0.49 




0 14 




0.14 0.73 



0.51 

0 19 

* 

* 


n 0 7ft 

V.JO v. f o 




-0 30 




1.37 1.59 

T 

C 

O.OS 

-0 20 



p 


T 

c 


-017 



p 

2.40 1.02 

T 

c 

0.30 

-0 36 


* 

F 

2.16 1.02 

T 


0,52 





1 72 0 89 



1.16 



* 


ft 7R ft 1R 



0.30 

-0.79 




ft 5td ft AO. 



0.51 





ft "Wl ft fifi 



-0.34 

-0.89 



p 

0 90 1.16 



0.06 

-0.67 



p 

0 75 0.85 



0.10 

-0.79 




n an ft 




-0.40 


* 


0 30 0 87 




-0 51 


* 

p 

1.00 0.58' 




0.06 


* 

p 

0.75 0.38 

T 


-0 29 

-0.20 



p 

2.00 0.22 

T 



0.23 



p 

1 .65 0.45 

T 



-0.01 



p 

2.50 0.58 

T 


0 68 

0.17 



p 

1.45 0.23 



1 02 

0.29 



F 

1.45 0.24 



0.98 

-0.10 




1 90 0.89 



0.97 

-0.24 




1.50 0.89 

T 


1.42 

-0.19 



F 

2.25 0.83 

T 


0.96 

-0.19 



F 

2.50 0.74 

T 


0.96 

0.10 



F 

1.80 1.14 

T 

c 

0.64 

-0.04 



F 

1.95 1.29 


c 

1.07 

-0.06 



F 

1.35 0.93 


c 

1.10 

0.70 



F 

0.61 1.48 

T 

c 

1.06 

0.27 



F 

1.12 1.77 

T 


0.81 

-0.21 



F 

1.78 1.98 
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Tyr 

78 


B 



T 


1.17 

-0.14 



F 

2.04 2.43 

Glu 

79 


B 



T 


1.38 

-0.83 



F 

2.60 3.14 

Glu 

80 

A 

B 





1.94 

-0.86 



F 

1.94 3.77 

Ala 

81 A 

A 






1.34 

-0.59 



F 

1.68 3.77 

Lys 

82 

A 

B 





1.36 

-0.66 



F 

1.42 1.79 

Gin 

83 

A 

B 





1.36 

-0.27 



F 

0.86 1.39 

Tyr 

84 

A 

B 





1.36 

0.49 




-0.45 2.15 

Leu 

85 


B 

B 




1.04 

-0.01 




0.45 1.87 

Ser 

86 


B 

B 




0.82 

0.47 




-0.45 1.55 

Tyr 

87 

A 

B 

B 




0.53 

0.76 




-0.60 0.82 

Glu 

88 

A 

B 

B 




-0.06 

0.76 




-0.45 1.55 

Thr 

89 

A 

B 

B 




0.19 

0.57 




-0.45 1.17 

Leu 

90 

A 

B 

B 




0.66 

0.59 




-0.45 1.20 

Tyr 

91 


B 



T 


0.66 

0.26 




0.36 0.69 

Ala 

92 





T 

C 

1.01 

0.64 




0.52 0.64 

Asn 

93 





T 

C 

0.70 

0.16 



F 

1.38 1.52 

Gly 

94 





T 

C 

1.01 

-0.04 



F 

2.24 1.40 

Ser 

95 






C 

1.51 

-0.80 



F 

2.60 2.39 

Arg 

96 



B 



C 

1.76 

-0.81 



F 

2.14 2.15 

Thr 

97 


B 

B 




1.49 

-0.81 

* 


F 

1.68 3.76 

Glu 

98 


B 

B 




1.14 

-0.60 



F 

1.42 2.08 

Thr 

99 


B 

B 




0.60 

-0.56 



F 

1.16 1.05 

Gin 

100 


B 

B 




0.66 

0.13 

* 

* 

F 

-0.15 0.51 

Val 

101 


B 

B 




-0.34 

0.40 




-0.60 0.46 

Gly 

102 


B 

B 




-0.84 

1.09 




-0.60 0.22 

He 

103 


B 

B 




-1.14 

1.29 




-0.60 0.1 1 

Tyr 

104 


B 

B 




-1.13 

1.27 




-0.60 0.19 

He 

105 


B 

B 




-1.43 

1.01 




-0.60 0.26 

Leu 

106 


B 

B 




-0.92 

0.97 




-0.35 0.50 

Ser 

107 


B 



T 


-0.58 

0.71 



F 

0.45 0.32 

Ser 

108 





T 

C 

-0.03 

-0.04 



F 

t on n tc l 
I.BU 0. tZ> 

Ser 

109 





T 

c 

-0.38 

-0.30 



F 

2.05 0.90 

Gly 

110 




T 

T 


0.51 

-0.49 

* 


F 

2.50 0.68 

Asp 

111 

A 



T 



L29 

-0.47 



F 

1.85 0.88 

Gly 

112 

A 





c 

1.70 

-0.36 



F 

1.40 0.89 

Ala 

113 

A 

B 





2.00 

-0.74 



F 

1.40 1.77 

Gin 

114 

A 

B 





2.00 

-1.17 




1.34 1.77 

His 

115 

A 

B 





2.00 

-0.79 



F 

1.58 2.39 

Arg 

116 

A 

B 





1.70 

-0.79 



F 

1.92 2.34 

Asp 

117 




T 

T 


1.74 

-0.90 



F 

3.06 1.81 

Ser 

118 




T 

T 


1.99 

-0.91 


* 

F 

3.40 1.79 
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1 19 



T 

T 


2.03 

-0.99 

* 

F 

2.91 0.90 

Ser 

120 



T 

T 


1.77 

-0.99 

* 

F 

3.02 1.08 


121 





c 

1.77 

-0.60 


F 

2.58 1.08 

Gly 

122 




T 

c 

1.88 

-0.99 

* 

F 

2.74 2.14 

Lys 

123 




T 


2.22 

-1.41 


F 

2.90 3.13 

Ser 





T 

Q 

2.68 

-1.80 


F 

3.00 4.66 

Arg 

125 




T 


2.98 

-2.19 


p 

2.50 9.23 

Arg 

126 

B 





2 39 

-2.21 


p 

2.00 7.99 

Lys 

127 

B 

B 




2.49 

-1.53 


F 

1.50 4.18 

Arg 

128 

B 

B 





-1 16 



1 20 3 35 

Gin 

129 

B 

B 





-0.73 



0.75 1.69 

He 

130 

B 

B 





0 03 



-0.15 1.32 

Tyr 

131 

B 

B 





0.03 



-0.15 1.13 

Gly 

132 






1 70 

0.41 

* * 


-fi an 0 R7 

Tyr 

133 

B 





0 89 

0.01 

* * 


0.05 2.44 

Asp 

134 

B 





0 59 

0.1 1 

* 

p 

0.40 1.35 

Ser 

135 

B 



T 


0.59 




1 00 1 83 

Arg 

136 

B 



T 







Phe 

137 

B 



T 



0 03 



\r. IV U.Hi 

Ser 

138 

B 

B 








-0 47 0 31 

He 

139 

B 

B 






* * 


-0.04 0.32 

Phe 

140 

B 

B 




0.02 




n no ft f>> 

Gly 

141 



T 

T 


-0.90 





Lys 

142 



T 

T 


-1.01 

037 



1.30 0.47 

Asp 

143 



T 

T 



0 37 


p 

1.17 0.45 

Phe 

144 

B 





-0 07 

-0.01 



1.09 0.73 

Leu 

145 

B 





0 42 

0.31 



0.16 0.57 

Leu 

146 

B 





0.07 

0.74 



-0.27 0.53 

Asn 

147 






-0.28 

1.53 



-0.40 0.53 

Tyr 

148 




T 


-0.59 

1.13 

* 


0.00 0.85 

Pro 

149 



T 

T 


-0.19 

0.93 

* 


0.35 1.50 





T 

T 


-0.23 

0.63 

* * 


0.35 1.25 

Ser 

151 

B 



T 


0.62 

0.87 

* * 

F 

-0.05 0.59 

Thr 

152 

B 

B 




-0.19 

0.11 

* 

F 

-0.15 0.76 

Ser 

153 

B 

B 




-0.24 

0.37 

* 

F 

-0.15 0.73 

Val 

154 

B 

B 




-0.34 

-0.03 

* 

F 

0.45 0-73 

Lys 

155 

B 

B 




0.01 

0.07 

* 

F 

-0.15 0.73 

Leo 

156 

B 

B 




-0.36 

0.0 1 

* 

F 

-0.15 0.54 

Ser 

157 

B 



T 


-0.36 

0.20 

* 

F 

0.25 0.39 

Thr 

158 

B 



T 


-0.40 

0.04 

* 

F 

0.25 0.28 

Gly 

159 



T 

T 


0.14 

0.47 

* 

F 

0.35 0.34 
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Cys 

160 





T 

T 

-0.71 

0.27 



F 

0.65 0.36 

Thr 

161 



B 




-0.76 

0.57 



F 

-0.25 0.21 

Gly 

162 



B 




-1.04 

0.73 



F 

-0.25 0.16 

Thr 

163 


A 

B 




-0.73 

0.80 



F 

-0.45 0.29 

Leu 

164 


A 

B 




-0.34 

0.23 




-0.30 0.35 

Val 

165 


A 

B 




0.29 

-0.26 




0.30 0.71 

Ala 

166 


A 

B 




-0.26 

-0.19 




0.30 0.67 

Glu 

167 

A 

A 





-0.72 

-0.03 




0.30 0.60 

Lys 

168 

A 

A 





-0.72 

-0.03 




0.30 0.67 

His 

169 

A 

A 





-0.50 

-0.19 




0.30 0.95 

Val 

170 

A 

A 





-0.23 

-0.19 

* 



0.30 0.56 

Leu 

171 

A 

A 





0.32 

0.31 

* 



-0.30 0.28 

Thr 

172 

A 

A 





-0.34 

0.81 

* 



-0.60 0.28 

Ala 

173 

A 

A 





-1.28 

0.89 

* 



-0.60 0.20 

Ala 

174 

A 

A 





-1.28 

0.93 

* 



-0.60 0.17 

His 

175 


A 

B 




-0.42 

0.74 




-0.32 0.16 

Cys 

176 


A 

B 




0.04 

0.26 




0.26 0.27 

lie 

177 


A 

B 




0.40 

0.19 




0.54 0.26 

His 

178 





T 

T 

0.68 

-0.31 

* 



2.22 0.39 

Asp 

179 





T 

T 

1.02 

-0.33 

* 


F 

2.80 1.04 

Gly 

1 80 





T 

T 

. 0.20 

-0.14 

* 

* 

F 

2.52 2.33 

Lys 

181 





T 

T 

0.91 

-0.19 

* 

* 

F 

2.24 1.27 

Thr 

182 



B 

B 



1.46 

-0.69 

* 

* 

F 

1.46 1.52 

Tyr 

183 



B 

B 



1.18 

-0.26 

* 

* 

F 

0.88 1.52 

Val 

184 



B 

B 



1.18 

-0.20 



F 

0.60 1.10 

Lys 

185 



B 

B 



1.57 

0.20 

* 

* 

F 

0.12 1.32 

Gly 

186 



B 




0.71 

-0.29 

* 

* 

F 

1.04 1.68 

Thr 

187 



B 




1.13 

-0.36 


* 

F 

1.16 1.87 

Gin 

188 



B 




0.52 

-1.00 

* 

* 

F 

1.58 1.83 

Lys 

189 



B 

B 



1.03 

-0.36 

* 

* 

F 

1.20 1.37 

Leu 

190 



B 

B 



0.29 

-0.36 

* 

* 

F 

0.93 0.94 

Arg 

191 



B 

B 



-0.18 

-0.06 

* 

* 


0.66 0.47 . 

Val 

192 



B 

B 



0.18 

0.23 


* 


-0.06 0.19 

Gly 

193 



B 

B 



-0.03 

0.23 


* 


-0.18 0.47 

Phe 

194 



B 

B 



-0.03 

-0.03 


* 


0.64 0.37 

Leu 

195 



B 

B 



0.08 

-0.03 


* 


1.13 1.00 

Lys 

196 



B 



T 

0.01 

0-11 


* 

F 

1.27 0.88 

Pro 

197 



B 



T 

0.87 

-0.31 

* 

* 

F 

2.36 2.02 

Lys 

198 





T 

T 

0.87 

-1.10 

* 

* 

F 

3.40 4.10 

Phe 

199 



B 



T 

1.22 

-1.36 

* 


F 

2.66 2.03 

Lys 

200 



B 




2.14 

-0.93 



F 

2.12 1.30 
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316 


Asp 

201 





T 

T 


1.76 

-1.36 

* 

* 

F 

2.38 1.27 

Gly 

202 





T 

T 


1.3S 

-0.93 

* 

* 

F 

2.04 1.45 

Gly 

203 





T 

T 


1.33 

-1.21 



F 

1.85 0.73 

Arg 

204 






T 

C 

2.03 

-0.81 

* 

* 

F 

1.95 0.71 

Gly 

205 







c 

1.69 

-0.81 

* 


F 

2.20 1.19 

Ala 

206 







c 

1.3S 

-0.86 

* 


F 

2.50 1.62 

Asn 

207 






T 

c 

1.42 

-0.80 



F 

3.00 1.19 

Asp 

208 






T 

c 

1.18 

-0.41 

* 


F 

2.40 1.61 

Ser 

209 






T 

c 

0.47 

-0.34 

* 


F 

2.10 1.61 

Thr 

210 






T 

c 

0.60 

-0.23 

* 


F 

1.65 0.99 

Ser 

211 







c 

1.19 

-0.20 



F 

1.15 0.92 

Ala 

212 


A 





c 

1.19 

-0.20 



F 

0.80 1.19 

Met 

213 


A 

B 





0.59 

-0.19 

* 



0.45 1.42 

Pro 

214 


A 





c 

0.93 

-0.06 


* 

F 

0.80 1.05 

Glu 

215 

A 

A 






0.54 

-0.44 


* 

F 

0.60 2.08 

Gin 

216 

A 

A 






0.84 

-0.16 


* 

F 

0.60 1.82 

Met 

217 

A 

A 






1.14 

-0.37 


* 

F 

0.60 2.04 

Lys 

218 

A 



B 




0.86 

0.11 

* 

* 


-0.15 1-24 

Phe 

219 

A 



B 




1.18 

0.80 

* 



-0.60 0.50 

Gin 

220 



B 

B 




0,32 

0.40 


* 


-0.60 0.99 

Tip 

221 



B 

B 




0.37 

0.43 

* 

* 


-0.60 0.37 

He 

222 



B 

B 




1.08 

0.43 

* 



-0.37 0.85 

Arg 

223 



B 

B 




0.72 

-0.36 




0.76 0.96 

Val 

224 



B 

B 




1.39 

-0.27 

* 



1.14 1.32 

Lys 

225 




B 

T 



0.53 

-0.69 

* 


F 

2.22 2.56 

Arg 

226 




B 

T 



0.61 

-0.73 

* 


F 

2.30 0.97 

Thr 

227 




B 

T 



1.54 

-0.30 

* 


F 

1.92 2.02 

His 

228 




B 



c 

1.09 

-0.94 



F 

1.79 2.02 

Val 

229 




B 



c 

1.66 

-0.51 



F 

1.56 1.02 

Pro 

230 



B 



T 


0.72 

0.40 



F 

0.18 0.74 

Lys 

231 





T 

T 


0.66 

0.60 


* 

F 

0.35 0.38 

Gly 

232 





T 

T 


0.62 

0.10 

* 

* 

F 

0.80 1.03 

Tip 

233 



B 



T 


0.66 

-0.11 


* 

F 

0.85 0.66 

He 

234 



B 





0.92 

-0.14 

* 

* 

F 

0.92 0.53 

Lys 

235 



B 





1.13 

0.36 



F 

0.59 0.54 

Gly 

236 







c 

1.09 

0.33 

* 

* 

F 

1.06 0.83 

Asn 

237 






T 

c 

0.54 

-0.59 


* 

F 

2.58 1.98 

Ala 

238 






T 

c 

0.49 

-0.59 

* 

* 

F 

2.70 0.69 

Asn 

239 






T 

c 

0.78 

-0.16 

* 


F 

2.13 0.69 

Asp 

240 





T 

T 


0.73 

0.03 


* 

F 

1.46 0.43 

He 

241 



B 





0.83 

-0.37 




1.04 0.71 
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Gly 

242 








0.83 

-0.1 1 


* 


0.77 0.69 

Met 

243 



B 





1 18 

-0 51 




U.OV U.D7 

Asp 

244 



B 



T 


0.59 






Tyr 

245 



B 



T 


-0.22 

0.06 


* 


0.25 1 57 

Asp 

246 



B 



T 


-0.14 

0.31 




0.25 1.31 

Tyr 

247 



B 



T 


0.20 

0.39 




0.10 0.65 

Ala 

248 

A 

A 






-0.01 

0.39 




-0.30 0.71 

Leu 
Leu 

249 
250 

A 
A 

A 
A 






0.03 
0.32 

0.31 
0.31 




-0.30 0.35 
ft it\ n 

Glu 

251 

A 

A 






0.1 1 

-0.44 




0.30 U.oy 

Leu 

252 

A 

A 






0.32 

-0.51 



F 

0.90 1.67 

Lys 

253 

A 

A 






0.96 

-0.70 



F 

o.yu A./b 

Lys 

254 

A 





T 


1.88 

-1.39 



F 

1.30 3.18 

Pro 

255 

A 





T 


2.73 

-1.39 




1.30 7.56 

His 

256 

A 





T 


2.03 

-2.07 



F 

1.30 7.56 

Lys 

257 

A 





T 


2.24 

-1.29 



F 

1.30 3.27 

Arg 

258 

A 







2.24 

-0.67 



F 

1.10 2,09 

Lys 

259 



B 





1.31 

-1.10 



F 

1.10 3.08 

Phe 

260 



B 

B 




1.18 

-0.91 




0.75 1.08 

Met 

261 



B 

B 




0.36 

-0.49 




0.30 0.55 

Lys 

262 



B 

B 




0.01 

0.16 




-0.30 0.20 

He 

263 



B 

B 




-0.31 

0.54 




-0.60 0.3 1 

Gly 

264 



B 

B 




-0.57 

0.19 




ft m r\ Af\ 

-0.02 0.4V 

Val 

265 




B 



C 

-0.46 

0.00 



F 

0.61 0.38 

Ser 

266 




B 



c 

0.19 

0.50 



F 

0.59 0.55 

Pro 

267 






T 

c 

0.14 

-0.19 





Pro 

268 





T 

T 


0.22 

-0.21 



F 


Ala 

269 





T 

T 


0.36 

-0.17 





Lys 

270 



B 



T 


0.87 

-0.13 





Gin 

271 



B 





0.82 

-0 13 



p 

1 36 1.01 

Leu 

272 



B 



T 


1.14 

-0.13 




113 0.99 

Pro 

273 



B 



T 



-0.63 


* 

p 

1.15 0.97 

Gly 

274 





T 

T 


1.02 

0.06 


* 

F 

0.65 0.39 

Gly 

275 



B 



T 


0.28 

0.16 



F 

0.25 0.65 

Arg 

276 



B 

B 




-0.02 

0.26 



F 

-0.15 0.36 

He 

277 



B 

B 




0.44 

0.21 


* 


-0.14 0.49 

His 

278 



B 

B 




0.41 

0.21 

* 

* 


0.02 0.49 

Phe 

279 



B 



T 


0.76 

0.54 

* 

* 


0.28 0.39 

Ser 

280 



B 



T 


1.10 

0.54 


* 


0.44 0.94 

Gly 

281 





T 

T 


0.99 

0.26 



F 

1.60 1.11 

Tyr 

282 





T 

T 


1.99 

-0.24 


* 

F 

2.04 2.14 
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Asp 

283 




T 



1.81 



p 

2.32 3.12 

Asn 

284 




T 



2-17 

-0 99 


p 

2.50 4.88 

Asp 

285 




T 



2.47 

-0.99 



Z.Do j.\Jo 

Arg 

286 





T 

C 

2.00 

-1.34 



2 86 2.97 

Pro 

287 




T 

T 


1.39 

-0.66 


F 

3.40 1.52 

Gly 

288 




T 

T 


1.14 

-0.41 


F 

2.61 0.68 

Asn 

289 


B 



T 


1.26 

0.34 


F 

1.27 0.54 

Leu 

290 


B 

B 




0.56 

0.34 



ft IS A fiQ 

Val 

291 


B 

B 




-0.22 

0.70 



-O.2o U.oU 

Tyr 

292 


B 

B 




-0.01 

0.84 



ft tzn ft 1ft 

Arg 

293 

A 

B 

B 




-0.52 

0.44 



r\ £-f\ f\ Af\ 

-O.oU 

Phe 

294 

A 

B 

B 




-0.48 

0.40 



n £ft ft Af\ 

Cys 

295 

A 

B 

B 




0.33 

-0.24 



O.JO O.jZ 

Asp 

296 

A 

B 

B 




1.19 

-1.00 



0.60 0.44 

Val 

297 

A 

B 





1.12 

-1.00 



ft fift ft 9S 
U.bU U.OO 

Lys 

298 

A 



T 



0.77 

-1.30 


F 

1.3U A.J f 

Asp 

299 

A 



T 



1.47 

-1.11 


F 

t -art *> ti 
1 .30 Z.Zj 

Glu 

300 

A 

B 


T 



1.32 

-1.11 


F 

1.30 5.01 

Thr 

301 

A 


B 

T 



0.51 

-1.07 


F 

1 .30 z.uo 

Tyr 

302 

A 


B 

T 



3.12 

-0.39 



0,85 1.02 

Asp 

303 

A 

B 

B 




1.08 

0.37 



-0.30 0.92 

Leu 

304 

A 

B 

B 




1.08 

0.77 



-0.45 1.11 

Leu 

305 

A 

B 

B 




0.41 

0.69 



-0.45 1 -22 

Tyr 

306 

A 

B 

B 




0.72 

0.50 



-0.60 0.39 

Gin 

307 

A 

B 

B 




0.67 

0.50 



f% Ct\ ft Qft 

-U.OU U.BU 

Gin 

308 

A 

B 

B 




0.67 

0.20 


F 

ft nft i ?o 

Cys 

309 

A 

B 

B 




1.27 

-0.09 


F 


Asp 

310 



B 

T 



1.73 

-0.41 


F 

1 25 1 28 

Ser 

311 


B 





1.39 

-0.39 


F 

1 15 0.73 

- Gin 

312 


B 



T 


1.09 

-0.29 



1.75 1.37 

Pro 

■313 





T 

c 

0.74 

-0.47 



2 20 1.10 

Gly 

314 





T 


1.11 

-0.04 


F 

2.50 0.81 

Ala 

315 




T 

T 

c 

0.77 

-0.04 


F 

2.25 0.63 

Ser 

316 





T 

c 

0.21 

-0.01 


F 

1.80 0.40 

Gly 

317 





T 

c 

-0.03 

0.20 


F 

0.95 0.30 

Ser 

318 


B 



T 


-0.68 

0.53 

* 

F 

0.20 0.47 

Gly 

319 


B 



T 


-0.22 

0.67 

* 

F 

-0.05 0.26 

Val 

320 


B 

B 




-0.23 

0.29 

* * 


-0.30 0.51 

Tyr 

321 


B 

B 




-0.22 

0.47 



-0.60 0.38 

Val 

322 


B 

B 




0.17 

1.00 

* * 


-0.60 0.40 

Arg 

323 


B 

B 




0.58 

0.57 

* * 


-0.45 1-09 


SUBSTITUTE SHEET (RULE 26) 


WO 00/29435 


PCT/US99/25031 


Met 

324 


B 

B 




0.92 

-0.07 

* 

* 


0.45 1.36 

Trp 

325 


B 

B 




1.74 

-0.43 

* 



0.45 3.17 


326 


B 

B 




1.99 

-0.57 

* 

* 


0.75 2.20 

Arg 

327 




T 



2.89 

-0.17 

* 

* 

F 

1.00 3.85 


32S 

\ 






2.49 

-0.79 





His 

329 






Q 

3.09 

-0.79 

* 


p 

1 10 3 85 

Gin 

330 







3 49 

-0 79 

* 


F 

1.10 3.41 

Lys 

331 











F 

1.30 3.85 

Tm 

irp 

332 




T 



2 49 

-1 19 



F 

1.30 5.66 

Glu 

333 







1 60 




F 

1 1 rt OQ 

l- 1\) z.zy 

Arg 

334 











F 

n tc n oft 

Lys 

335 



B 




0.69 

-0.29 




0.30 0.76 

He 

336 












U.OU U.4.J 

lie 

337 

A 

B 

B 




-0.26 

0.10 




ft 1ft ft 1 c 

-U.JO U.15 









-0.60 

0.49 




ft £.ft ft 1 ft 

-o.oo u. iu 

Met 



B 





-0.74 

0.91 




-0.60 0.15 


340 


B 

B 




-0.79 

0.73 




ft Aft ft TO 

Ser 

341 





T 

c 

-0.19 

0.44 




ft Aft ft Cft 

U.UU D.jU 

Gly 

342 





T 

c 

-0.16 

0.93 




ft ftft ft CI 

U.UU U.O.J 

His 

343 





T 

c 

0.19 

0.96 




ft ftft ft C£. 
U.UU U.DO 

Gin 

344 


B 



T 


0.19 

0.27 




0.10 0.70 

Trp 

345 


B 





1.08 

0.50 




ft Aft ft *TA 

-U.40 U.7U 

Val 

346 


B 





1.03 

0.07 




n '■jft ft o £ 
U.ZU U.BO 

Asp 



B 





1.08 





ft "7ft ft ylft 

Met 

348 




T 

T 


0.90 

-0.01 



F 

2.15 0.62 














2.60 1 .30 

Gly 

350 





T 

Q 

1.19 

-0.74 


* 

p 

3.00 1.35 














2 70 2 35 

Pro 

352 





T 

Q 

1 03 

-0.57 

* 

* 

p 

2.40 1.22 

Gin 

353 




T 

T 


1 74 

-0.09 



p 

2.25 1.78 

Glu 

354 





T 


1.40 

-0.51 



p 

2.10 2.60 

Phe 

355 


B 





1.08 

-0.47 



F 

1.55 1.67 

Thr 

356 




T 

T 


1.08 

-0.33 

* 


F 

2.25 0.52 

Arg 

357 




T 

T 


1.29 

-0.34 

+ 

* 

F 

2.50 0.40 

Gly 

358 




T 

T 


0.40 

-0.34 

* 


F 

2.25 0.80 

Cys 

359 




T 

T 


0.09 

-0.44 

* 

* 

F 

2.00 0.39 

Ser 

360 



B 



c 

0.58 

-0.44 

* 


F 

1.15 0.29 

Glu 

361 



B 

T 



0.08 

-0.01 

* 

* 

F 

1.10 0.45 

lie 

362 


B 

B 




-0.03 

0.24 

* 

* 

F 

-0.15 0.69 

Thr 

363 


B 

B 




0-07 

0.07 



F 

-0.15 0.89 

Pro 

364 


B 

B 




-0.16 

0.44 



F 

-0.45 0.80 
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365 

B 

Gin 

366 

B 

Tyr 

367 

B 

He 

368 

B 

Pro 

369 

B 

Asp 

370 

B 

He 

371 

B 

Ser 

372 

B 

He 

373 

B 

Gly 

374 

B 

Val 

375 

B 

Ter 

376 

B 
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B -0.07 

B -0.07 

B -0.07 

B -0.06 

B -0.73 

B -0.27 

B -1.12 

B -1.27 

B -0.77 

B -0.94 

B . . -1.33 

B -0.83 


1.13 

* 



-0.60 

0.80 

0.87 

* 



-0.60 

0.86 

0.39 




-0.30 

0.93 

0.64 


* 


-0,60 

0.79 

0.34 


* 


-0.30 

0.61 

0.63 


* 

F 

-0.45 

0.27 

0.30 


* 

F 

-0.15 

0.39 

0.26 


* 


-0.30 

0.19 

0.26 


* 


-0.30 

0.14 

0.69 




-0.60 

0.26 

0.43 


* 


-0.60 

0.25 

0.47 


* 


-0.60 

0.45 
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Table X 


Res Position 

I 

II 

ra 

IV 

V 

VI 

VII VIII 

IX 

X 

XI 

xn 

XIII 

XIV 

Met 

1 



B 

B 



-1.26 

0.43 




-0.60 

0.37 

Ala 

2 



B 

B 



-1.68 

0.64 




-0.60 

0.22 

Ser 

3 



B 

B 



-L50 

0.90 




-0.60 

0.14 

Val 

4 



B 

B 



-L41 

0.90 




-0.60 

0.22 

Val 

5 



B 

B 



-1.37 

0.67 




-0.60 

0.29 

Leu 

6 



B 



T 

-1.07 

0.60 



F 

-0.05 

0.21 

Pro 

7 





T 

T 

-0.48 

0.60 



F 

0.35 0.39 

Ser 

8 





T 

T 

-0.84 

0.36 



F 

0.65 0.90 

Gly 

9 





T 

T 

-0.58 

0.29 



F 

0.65 0.58 

Ser 

10 






T 

C -0.31 

0.10 



F 

0.45 0.38 

Gin 

11 

A 





T 

-0.09 

0.17 



F 

0.25 0.29 

Cys 

12 

A 





T 

-0.47 

0.29 




0.10 0.29 

Ala 

13 

A 





T 

-0.76 

036 




0.10 0.22 

Ala 

14 

A 

A 





-1.00 

0.47 




-0.60 

0.13 

Ala 

15 

A 

A 





-1.29 

0.57 




-0.60 

0.24 

Ala 

16 

A 

A 





-1.88 

0.50 




-0.60 

0.24 

Ala 

17 

A 

A 





-1.42 

0.50 




-0.60 

0.24 

Ala 

IS 

A 

A 





-1.04 

0.43 




-0.60 

0.37 

Ala 

19 

A 

A 





-0.80 

0.36 




-0.30 

0.57 

Ala 

20 

A 

A 





-1.02 

0.29 




-0.30 

0.56 

Pro 

21 

A 





T 

-0.32 

0.47 


* 

F 

-0.05 

0.46 

Pro 

22 

A 





T 

. -0.54 

-0.03 


* 

F 

0.85 0.89 

Gly 

23 

A 





T 

0.16 

0.16 


* 

F 

0.25 0.72 

Leu 

24 

A 





T 

-0.07 

-034 


* 


0.70 0.92 

Arg 

25 


A 

B 




-0.29 

-0.09 




0.30 0.49 

Leu 

26 


A 

B 




-0.89 

0.17 


* 


-0.30 

0.41 

Arg 

27 


A 

B 




-1.49 

0.43 


* 


-0.60 

0.4,1 

Leu 

28 


A 

B 




. -1.96 

0.43 


* 


-0.60 

0.17 

Leu 

29 


A 

B 




-1.84 

1.11 


* 


-0.60 

0.17 

Leu 

30 


A 

B 




-2.26 

1.21 




-0.60 

0.08 

Leu 

31 

A 

A 





-2.03 

1.60 

* 

* 


-0.60 

0.12 

Leu 

32 

A 

A 





. -2.73 

1.41 

* 

* 


-0.60 

0.15 

Phe 

33 

A 

A 





-2.51 

1.23 




-0.60 

0.18 

Ser 

34 

A 

A 





-2.51 

1.04 




-0.60 

0.23 

Ala 

35 

A 

A 





-2.59 

1.04 




-0.60 

0.23 

Ala 

36 

A 

A 





-1.99 

1.04 




-0.60 

0.18 
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Ala 

37 

A 

A 





-1.49 

0.69 




-0.60 0.21 

Leu 

38 


A 

B 




-1.13 

0.79 




-0.60 0.30 

He 

39 


A 

B 




-0.83 

0.71 




-0.32 0.30 

Pro 

40 


A 

B 




-0.59 

0.21 



F 

0.41 0.49 

Thr 

41 





T 


0.00 

0.14 



F 

1.29 0.59 


42 





T 

T 

0.59 

-0.14 



F 

2.52 1.45 


43 





T 

T 

0.59 

-0.43 



F 

2.S0 1.51 


44 






T 

C 0.78 

-0.17 



F 

2.17 0.86 

Gin 

45 



B 



T 

0.68 

0.13 



F 

1.09 0.75 


46 



B 

B 



1.03 

0.19 



F 

0.41 0.65 

iZ 

47 



B 

B 



1.38 

0.19 



F 

0.28 1.32 

Phe 

48 



B 

B 



0.52 

-0.24 

* 


F 

0.60 1.27 

Thr 

49 



B 

B 



0.56 

0.00 

* 


F 

-0.15 0.59 

Lys 

50 



B 

B 



-0.30 

0.09 

* 


F 

0.00 1.02 


5 1 



B 

B 



. -1.19 

0.04 

* 


F 

-0.15 0.88 

vZ 

52 



B 

B 



-0.38 

-0.06 




0.30 0.43 

Thr 

53 



B 

B 



-0.02 

-0.54 

* 



0.60 0.37 

Val 

54 



B 

B 



0.29 

-0.11 

* 



0.30 0.22 

lie 

55 




B 



-0.61 

-0.11 

* 



0.30 0.51 

Glu 

56 

A 



B 



-1.20 

-0.11 



F 

0.45 0.26 

Gly 








-0.66 

-0.10 

* 


F 

0.65 0.36 

Glu 

58 




B 



-1.23 

-0.26 



F 

0.45 0.74 






B 



-0.68 

-0.26 




0.30 0.30 


60 




B 



-0.46 

0.13 


* 


-0.30 0.40 

Thr 

61 




B 



-0.46 

0.27 




-0.30 0.12 

He 

62 




B 



-0.97 

0.67 


* 


-0.60 0.29 

Ser 

63 




B 



-0.97 

0.67 

* 



-0.60 0.21 

Cys 

64 



B 

B 



-0.07 

0.57 

* 

* 


-0.60 0.24 

Gin 

65 



B 

B 



0.22 

0.09 


* 


0.04 0.68 

Val 

66 



B 

B 



0.53 

-0.21 

* 

* 


0.98 0.68 


67 




B 

T 


1.42 

-0.60 

* 

* 

F 

2.32 2.12 

Lys 

68 




B 

T 


1.42 

-1.17 

* 


F 

2.66 2.05 

Ser 

69 





T 

T 

1.23 

-1.19 

* 


F 

3.40 3.69 

Asp 

70 





T 

T 

0.34 

-1.19 

* 


F 

3.06 1.70 

Asp 

71 



B 



T 

1.20 

-0.90 

* 


F 

2.17 0.60 

Ser 

72 



B 



T 

0.39 

-0.50 

* 



1.38 0.77 

Val 

73 



B 

B 



-0.47 

-0.20 

* 



0.64 0.38 

He 

74 



B 

B 



-0.17 

0.49 

■ * 



-0.60 0.19 

Gin 

75 



B 

B 



-0.38 

0.89 

* 



-0.60 0.23 

Leu 

76 



B 

B 



-0.38 

0.93 

* 



-0.60 0.47 

Leu 

77 



B 

B 



0.03 

0.69 




-0.17 1-08 
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Asn 

78 






T 

C 

0.89 

0.00 

* 


■ F 

1.16 1.22 

Pro 

79 






T 

C 

1.47 

0.00 



F 

1.44 2.56 

Asn 

SO 





T 

T 


0.58 

-0.20 

* 


F 

2.52 4.49 

Arg 

81 





T 

T 


1.14 

-0.20 



F 

2.80 1.96 

Gin 

82 



6 

B 




1.26 

0.16 


* 

F 

1.12 1.98 

Thr 

83 



B 

B 




1.37 

0.51 


* 

F 

0.54 1.07 

He 

84 



B 

B 




1.58 

0.11 


* 


0.41 1.07 

Tyr 

85 



B 

B 




0.88 

0.11 

* 

* 


0.13 1.03 

Phe 

86 



B 

B 




0.8S 

0.50 


* 


-0.60 0.62 

Arg 

87 



B 

B 




0.67 

0.01 

* 



-0.15 1.73 

Asp 

88 



B 

B 




0.17 

-0.24 

* 


F 

0.94 1.70 

Phe 

89 



B 





1.10 

-0.31 



F 

1.48 1.62 

Arg 

90 







c 

1.34 

-1.10 

* 


F 

2.32 1.66 

Pro 

91 







c 

1.74 

-1.10 

* 


F 

2.66 1.66 

Leu 

92 





T 

T 


1.74 

-0.71 

* 

* 

F 

3.40 2.56 

Lys 

93 





T 

T 


1.04 

-1.50 


* 

F 

3.06 2.56 

Asp 

94 





T 

T 


1.74 

-0.71 

* 

* 

F 

2.72 1.44 

Ser 

95 

A 





T 


0.82 

-0.74 

* 

* 

F 

1.98 3.01 

Arg 

96 


A 

B 





0.22 

-0.74 



F 

1.24 1.24 

Phe 

97 


A 

B 





L03 

-0.06 




0.30 0.61 

Gin 

98 


A 

B 





0.29 

0.34 




-0.30 0.74 

Leu 

99 


A 

B 





-0.01 

0.74 


4c 


-0.60 0.33 

Leu 

100 


A 

B 





-0.01 

1.13 


* 


-0.60 0.50 

Asn 

101 


A 





c 

-0.42 

0.73 


* 


-0.40 0.39 

Phe 

102 


A 





c 

0.28 

0.71 

* 


F 

-0.25 0.63 

Ser 

103 






T 

c 

-0.53 

0.03 



F 

0.60 1.33 

Ser 

104 

A 





T 


0.32 

0.03 

* 

* 

F 

0.25 0.68 

Ser 

105 

A 





T 


0.28 

-0.37 


* 

F 

1.00 1.58 

Glu 

106 

A 





T 


-0.02 

-0.51 


* 

F 

1.15 0.87 

Leu 

107 

A 



B 




-0.13 

-0.51 


* 

F 

0.75 0.87 

Lys 

108 

A 



B 




-0.14 

-0.21 


* 

F 

0.45 0.54 

Val 

109 

A 



B 




0.16 

-0.11 

* 

* 


0.30 0.45 

Ser 

no 



B 

B 




-0.40 

0.29 


* 


-0.30 0.87 

Leu 

in 



B 

B 




-0.70 

0.24 


* 


-0.30 0.32 

Thr 

112 



B 

B 




-0.78 

0.63 


* 


-0.60 0.58 

Asn 

113 



B 

B 




-1.12 

0.67 




-0.60 0.31 

Val 

114 



B 

B 




-0.27 

0.67 




-0.60 0.50 

Ser 

115 



B 

B 




0.03 

-b.oi 




0.64 0.58 

lte 

116 



B 

B 




0.50 

-0.50 


* 

F 

1.13 0.62 

Ser 

117 



B 



T 


0.92 

-0.47 

* 

* 

F 

1.87 0.83 

Asp 

118 





T 

T 


0.68 

-1.11 

* 


F 

3.06 1.21 


SUBSTITUTE SHEET (RULE 26) 


WO 00/29435 


PCT7US99/25031 


Glu 

1 19 








0.83 

-0.74 



F 

3.40 2.70 

Gly 

120 





T 



0.47 

-0,64 



F 

3.06 1.74 

Arg 

121 




B 

T 



1.36 

-0.46 



F 

1.87 0.56 

lyr 

122 




B 

T 



0.84 

-0.06 




1.38 0.56 

Phe 

123 



B 

B 




0.60 

0.63 




-0.26 0.47 

Cys 

124 



B 

B 




0.29 

0.96 




-0.60 0.37 

Gin 

125 



B 

B 




0.63 

1.44 




-0.60 0.34 

Leu 

126 



B 

B 




0.31 

0.69 




-0.60 0.66 

Tyr 

127 





T 



0.34 

0.33 




0.79 1.91 

Thr 

128 





T 



1.04 

0.19 



F 

1.28 1.71 

Asp 

129 






T 

C 

1.71 

0.19 



F 

1.62 3.58 

Pro 

130 






T 

C 

1.41 

-0.50 



F 

2.86 3.96 

Pro 

131 





T 

T 


1.98 

-0.87 



F 

3.40 3.68 

Gin 

132 





T 

T 


1.91 

-0.60 



F 

3.06 3.45 

Glu 

133 



B 

B 




1.91 

-0.11 



F 

1.62 3.22 

Ser 

134 



B 

B 




1.02 

-0.06 



F 

1.28 3.01 

Tyr 

135 



B 

B 




0.92 

0.20 



F 

0.34 1.22 

Thr 

136 



B 

B 




0.28 

0.29 



F 

0.00 1.01 

Thr 

137 



B 

B 




-0.53 

0.93 



F 

-0.45 0.56 

He 

138 



B 

B 




-1.39 

1.23 




-0.60 0.30 

Thr 

139 



B 

B 




-1.30 

1.11 




-0.60 0.15 

Val 

140 



B 

B 




-1.27 

1.06 




-0.60 0.16 

Leu 

141 



B 

B 




-0.84 
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2.86 2.87 

Arg 

362 



T 

T 


0.86 

-1.20 

* 

F 

3.40 2.54 

Ala 

363 



T 

T 


1.36 

-1.26 



2.91 1.95 

Arg 

364 



T 



1.28 

-0.83 



2.37 1.86 
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Table XI 


Res Position 

I 

I) 

III 

IV 

V 

VI 

VII 

VIII 

IX 

X 

XI 

Xll 

XIII 

XIV 

Met 


A 







0.14 

0.06 




-0.10 

0.89 

Ser 

2 

A 





T 


-0.28 

0.01 




0.10 0.94 

Ser 

3 

A 





T 


0.16 

0.27 




0.10 0.60 

Ser 

4 

A 





T 


0.51 

-0.16 

* 



0.85 1.22 

Ser 

5 

A 





T 


0.09 

-0.27 

* 


F 

1.00 1 

24 

Leu 

6 

A 

A 






-0.12 

0.03 

* 


F 

-0.15 

0.76 

Lys 

7 

A 

A 






-0.49 

0.33 

* 


F 

-0.15 

0.47 

His 

8 

A 

A 






-0.79 

0.51 

* 



-0.60 

0.19 

Leu 

9 

A 

A 






-1.08 

0.74 

* 



-0.60 

0.23 

Leu 

10 

A 

A 






-1.59 

0.56 

* 

* 


-0.60 

O.U 

Cys 

11 

A 

A 






-L08 

1.24 

* 

* 


-0.60 

0.07 

Met 

12 

A 

A 






-1.41 

1.13 




-0.60 

0.11 

Ala 

13 

A 

A 






-2.08 

L36 




-0.60 

0.14 

Leu 

14 

A 

A 






-1.57 

1.46 

* 



-0.60 

0.23 

Ser 

15 

A 



B 




-1.06 

1.27 

* 



-0.60 

0.31 

Trp 

16 

A 



B 




-1.09 

1.04 

* 



-0.60 

0.41 

Phe 

17 

A 



B 




-1.38 

1.33 

* 



-0.60 

0.43 

Ser 

18 




B 



c 

-1.09 

1.33 

* 



-0.40 

0.23 

Ser 

19 




B 



c 

-0.62 

1.33 

* 



-0.40 

0.29 

Phe 

20 




B 



c 

-0.32 

0.84 




-0.40 

0.33 

He 

21 




B 



c 

-0-34 

0.06 




-0.10 

0.43 

Ser 

22 




B 



c 

0.06 

0,16 

* 

* 

F 

0.05 0.46 

Gly 

23 







c 

-0.34 

0.16 


* 

F 

0.25 0.71 

Glu 

24 







c 

-0.34 

0.16 



F 

0.25 0.88 

Thr 

25 







c 

-0.46 

-0.14 


* 

F 

0.85 0.88 

Ser 

26 







c 

-0.38 

0.16 


* 

F 

0.25 0.73 

Phe 

27 

A 







-0.08 

0.41 


* 


-0.40 

0.35 

Ser 

28 

A 







-0.03 

0.81 

* 

* 


-0.40 

0.39 

Leu 

29 







c 

-0.73 

0.71 

* 



-0.20 

0.39 

Leu 

30 







c 

-1.12 

1.11 




-0.20 

0.39 

Asn 

31 





T 

T 


-1.63 

1.11 




0.20 0.25 

Ser 

32 





T 

T 


-1.14 

1.41 

* 



0.20 0.25 

Phe 

33 





T 

T 


-1.09 

1.16 




0.20 0.47 

Phe 

34 



B 



T 


-0.49 

1.23 




-0.20 

0.46 

Leu 

35 







c 

0.02 

1.26 




-0.20 

0.53 

Pro 

36 





T 



-0.28 

1.26 




0.00 0.82 


SUBSTITUTE SHEET (RULE 26) 


WO 00/29435 


PCT/US99/25031 


332 


Tyr 

37 




Pro 

38 



T 

Ser 

39 



T 

Ser 

40 



T 

Arg 

41 


B 

T 

Cys 

42 


B 

T 

Cys 

43 


B 

T 

Cys 

44 


B 

T 

Phe 

45 


B 

T 

Ser 

46 


B 

T 

Val 

47 


B 

T 

Gin 

48 


B 

T 

Cys 

49 


B 

T 

Ser 

50 


B 

T 

He 

51 


B 

T 

Leu 

52 


B 


Asp 

53 




Pro 

54 



T 

Phe 

55 



T 

Ser 

56 



T 

Cys 

57 



T 

Asn 

58 



T 

Ser 

59 



T 

Met 

60 



T 

Arg 

61 




Phe 

62 




Pro 

63 



T 

Trp 

64 



T 

Glu 

65 

A 



Asn 

66 



T 

Ter 

67 



T 





0.86 


* 

P 

0.67 1.27 

T 


0 17 

0.07 



P 

1.14 3.02 

T 


0.20 

-0 04 


* 

P 

1.91 1.05 

T 


0.34 

0.10 



P 

1.33 0.36 



-0.14 





1.70 0.12 



-0.20 

0.27 




ft 7R ft O.R 



-0.84 

0.27 




ft fit n fiR 



-0.54 

0.53 




ft \a ft m 



-0.91 

0.93 




n m ft i fi 



-1.32 

0.93 




n on n in 



-1.54 

0.74 




n in n K 



-1.69 

0.86 







-1.02 

0.76 




n on n n 



-0.53 

0.37 




n i n n OS 
U. 1U U.ZS 



-0.93 

0.16 




A 1 ft ft o^ 



-0.38 

0.54 




a on ft rfA 

T 

C 

-1.04 

0,36 




ft in a /in 

T 


-0.38 

0.54 




0.33 U-jSU 

T 


-0.38 

0.26 




n en n CO 

T 


-0.09 

-0.04 




1.10 0.48 

T 


0.S3 

0.57 




A OA n in 

T 


0.13 

0.14 




0.50 0.69 

T 


0.13 

0.14 

* 

* 


0.50 0.44 

T 


0.54 

0.19 

* 

* 


0.86 1.28 


c 

0.84 

0.53 

* 

* 


0.22 0.84 


c 

1.51 

0.13 

* 

* 


0.88 1.08 



1.12 

0.14 

* 



1.29 1.76 



1.03 

-0.04 


* 


2.10 1.15 



1.24 

0.39 


* 


0.89 1.70 



0.74 

0.03 


* 


1.08 1.40 



1.06 

0.03 




0.87 1.70 
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Table XII 


Res Position 

I 

II 

III 

IV 

V 

VI 

VII 

VIII 

IX 

X 

XI 

XII 

xni xiv 

Met 

"1 



B 





0.97 

-0.71 

* 



1.64 1.91 

Ser 

2 



B 



T 


0.76 

-0.76 




2.07 2.00 

Arg 

3 



B 



T 


0.33 

-0.57 




2.30 1.55 

Arg 

4 



B 



T 


-0.09 

-0.31 




1.77 1.29 

Ser 

5 



B 



T 


-0.29 

-0.24 




1.39 0.79 

Met 

6 


A 

B 





0.02 

-0.13 




0.76 0.41 

Leu 

7 


A 

B 





-0.27 

0.79 




-0.37 0.22 

Leu 

8 


A 

B 





-1.19 

1.29 




-0.60 0.17 

Ala 

9 


A 

B 





-1.51 

1.59 




-0.60 0.14 

Trp 

10 


A 

B 





-1.51 

1.40 




-0.60 0.26 

Ala 

11 


A 

B 





-1.72 

1.10 




-0.60 0.42 

Leu 

12 



B 



T 


-1.72 

1.10 

* 



-0.20 0.34 

Pro 

13 



B 



T 


-0.80 

1.29 

* 

* 


-0.20 0.27 

Ser 

14 



B 



T 


-1.02 

0.37 

* 

* 


0.10 0.52 

Leu 

15 



B 



T 


-1.0S 

0.56 

* 

* 


-0.20 0.52 

Leu 

16 


A 

B 





-1.08 

0.30 

* 

* 


-0.30 0.33 

Arg 

17 


A 

B 





-0.86 

0.37 

* 

* 


-0.30 0.25 

Leu 

18 


A 

B 





-0.64 

0.49 


* 


-0.60 0,31 

G!y 

19 


A 





C 

-0.34 

0.20 


* 


-0.10 0.65 

Ala 

20 


A 





C 

0.16 

-0.49 


* 


0.50 0.57 

Ala 

21 


A 





c 

0.97 

0.00 




0.65 1.00 

Gin 

22 


A 

B 





0.S6 

-0.69 


* 

F 

1.21 1.75 

Glu 

23 


A 

B 





1.46 

-1.11 



F 

1.52 2.90 

Thr 

24 


A 



T 



1.21 

-1.19 



F 

2.23 4.44 

Glu 

25 


A 



T 



1.13 

-1.19 



F 

2.54 2.59 

Asp 

26 





T 

T 


1.06 

-1.01 



F 

3.10 0.80 

Pro 

27 





T 

T 


0.76 

-0.44 



F 

2.49 0.30 

Ala 

28 





T 

T 


0.54 

-0.54 




2.33 0.23 

Cys 

29 





T 

T 


-0.03 

-0.11 




1.72 0.21 

Cys 

30 



B 

B 




-0.89 

0.57 




-0.29 0.10 

Ser 

31 



B 

B 




-1.10 

0.79 




-0.60 0.07 

Pro 

32 



B 

B 




-0.78 

0.71 




-0.60 0.20 

He 

33 



B 

B 




-0.19 

0.14 




0.00 0.75 

Val 

34 



B 



T 


0.48 

-0.03 


* 

F 

1.45 0.90 

Pro 

35 



B 



T 


.0.86 

-0.41 



F 

1.90 1.01 

Arg 

36 





T 

T 


1.20 

0,07 


* 

F 

2.00 1.51 
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0.82 

-0.61 



F 

3.00 4.07 

Glu 

18 

,?o 





T 



0.90 

-0.76 



F 

2.50 2.66 

Trp 

39 


A 



T 



1.17 

-0.50 



F 

2.20 1.12 

Lys 

40 


A 





C 

1.08 

0,00 




1.10 0.70 

Ala 

41 


A 





c 

0.97 

-0.01 




0.80 0.54 

Leu 

42 


A 





c 

0.30 

-0.01 




0.50 0.90 

Ala 

43 

A 

A 






-0.29 

-0.36 




0.30 0.24 

Ser 

44 

A 

A 






0.00 

0.14 




-0.30 0,24 

Glu 

45 

A 

A 






-0.08 

0.04 




-0.30 0.50 

Cys 

46 

A 

A 






-0.30 

-0.14 




0.30 0.68 

Ala 

47 

A 

A 






0.21 

0.04 




-0.30 0.42 

Gin 

48 


A 

B 





-0.01 

0.04 




-0.30 0.32 

His 

49 


A 

B 





0.08 

0.73 




-0.60 0.50 

Leu 

50 


A 

B 





-0.73 

0.59 




-0.60 0.76 

Ser 

51 


A 

B 





0.04 

0.77 




-0.60 0.36 

Leu 

52 



B 





0.39 

0.37 




-0.10 0.52 

Pro 

53 



B 

B 




-0.47 

0.63 




-0.60 0.99 

Leu 

54 



B 

B 




-1.29 

0.59 




-0.60 0.55 

Arg 

55 



B 

B 




-1.33 

0.84 




-0.60 0.49 

Tyr 

56 



B 

B 




-1.33 

0.80 




-0.60 0.24 

Val 

57 



B 

B 




-0.56 

0.76 




-0.60 0.39 

Val 

58 



B 

B 




-0.66 

0.57 




-0.60 0.27 

Val 

59 



B 

B 




-0.43 

1.06 




-0.60 0.25 

Ser 

60 



B 





-0.89 

0.80 




-0.40 0.34 

His 

61 



B 





-0.94 

0.59 




-0.40 0.45 

Thr 

62 



B 





-0.39 

0.33 




-0.10 0.81 

Ala 

63 





T 



-0.20 

0.07 



F 

0.45 0.81 

Gly 

64 





T 

T 


0.66 

0.26 



F 

0.65 0.32 

Ser 

65 





T 

T 


0.64 

0.16 



F 

0.65 0.35 

Ser 

66 





T 

T 


0.47 

0.16 



F 

0.65 0.51 

Cys 

67 





T 

T 


0.19 

0.09 



F 

0.65 0.79 

Asn 

68 





T 



0.4S 

0.16 



F 

0.45 0.60 . 

Thr 

69 







c 

0.16 

0.16 



F 

0.25 0.60 

Pro 

70 





T 

T 


0.46 

0.34 



F 

0.65 0.60 

Ala 

71 





T 

T 


0.76 

0.17 



F 

0.65 0.64 

Ser 

72 



B 



T 


1.42 

0.17 



F 

0.25 0.77 

Cys 

73 



B 



T 


0.83 

0.09 

* 

* 

F 

0.25 0.86 

Gin 

74 


A 

B 





1.26 

0.16 


* 

F 

-0.15 0.86 

Gin 

75 


A 

B 





1.47 

-0.34 

* 

* 

F 

0.60 1.26 

Gin 

76 


A 

B 





120 

-0.33 

* 

* 

F 

0.60 3J9 

Ala 

77 


A 

B 





1.50 

-0.26 

* 


F 

0.60 L62 
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Arg 

78 

A 

B 





2.13 

-0.26 



7 

0.60 1.62 

Asn 

79 

A 

B 





1.89 

-0.16 




0.45 1.28 

Val 

80 

A 

B 





1.86 

0.20 




-0.15 1.98 

Gin 

81 

A 

B 





1.26 

0.20 




-0.15 1.37 

His 

82 

A 

B 





1.89 

0.81 

* 

* 


-0.60 0.85 

Tyr 

83 


B 





1.47 

0.41 

* 



-0.25 2.28 

His 

84 


B 





0.66 

0.26 




0.05 1.90 

Met 

85 


B 

B 




1.17 

0.54 




-0.45 1.15 

Lys 

86 


B 

B 




0.88 

0.47 




-0.60 0.73 

Thr 

87 



B 

T 



0.24 

0.63 




-0.20 0.56 

Leu 

88 



B 

T 



0.49 

0.70 




-0.20 0.30 

Gly 

89 



B 

T 



-0.33 

0.09 




0.10 0.25 

Trp 

90 


B 

B 




-0.08 

0.73 




-0.60 0.13 

Cys 

91 


B 

B 




-0.37 

0.67 




-0.60 0.16 

Asp 

92 


B 



T 


-0.06 

0.74 


* 


-0.20 0.25 

Val 

93 


B 



T 


0.06 

0.71 




-0.20 0.38 

Gly 

94 


B 



T 


-0.41 

0.59 


* 


-0.20 0.61 

Tyr 

95 


B 



T 


-1.01 

0.70 


* 


-0.20 0.30 

Asn 

96 


B 

B 




-0.69 

1.39 


* 


-0.60 0.29 

Phe 
Leu 

97 
98 


B 
B 

B 

B . 




-0.69 
0.17 

1.17 
0.74 


* 
* 


-0.60 0.29 
-0.60 0.32 

He 

99 


B 

B 




0.17 

-0.01 




0.30 0.33 

Gly 

100 


B 



T 


-0.40 

0.01 




0.10 0.38 

GIu 

101 


B 



T 


-1.26 

-0.09 



F 

0.85 0.38 

Asp 

102 




T 

T 


-0.80 

-0.13 



F 

1.25 0.40 

Gly 

103 





T 

C 

0.01 

-0.06 



F 

1.05 0.63 

Leu 

104 


B 





0.56 

-0.49 

* 

* 


0.50 0.63 

Val 

105 


B 





1.01 

-0.06 

* 



0.78 0.37 

Tyr 

106 


B 





0.67 

-0.06 




1.06 0.74 

GIu 

107 


B 





0.38 

-0.06 



F 

1.49 0.89 

Gly 

108 




T 

T 


0.72 

0.17 



F 

1.92 1.25 

Arg 

109 




T 

T 


0.83 

-0.07 


* 

F 

2.80 1.29 . 

Gly 

110 




T 

T 


1.38 

-0.04 



F 

237 0.64 

Trp 

111 




T 

T 


1.28 

0.44 




1.04 0.94 

Asn 

112 






C 

0.69 

0.44 




0.36 0.47 

Phe 

113 


B 





1.00 

0.94 




-0.12 0.4S 

Thr 

114 






c 

0.59 

1.01 




-0.20 0.63 

Gly 

115 






c 

0.59 

0.49 


* 


-0.20 0.52 

Ala 

116 






c 

0.S4 

0.51 


* 


-0.20 0.60 

His 

117 





T 

c 

0.03 

0.23 




0.30 0.56 

Ser 

118 





T 

c 

0.44 

0.43 




0.00 0.47 
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Gly 

119 





T 

C 

0.76 

0.91 




0.00 0.49 

His 

120 





T 

C 

0.S9 

0.81 




0.00 0.58 

Leu 

121 




T 



0.S8 

0.74 




0.00 0.67 

Trp 

122 






c 

0.61 

0.97 




-0.20 0.67 

Asn 

123 






c 

0.02 

0.93 




-0.20 0.66 

Pro 

124 


B 

B 




0.02 

1.11 


* 


-0.60 0.56 

Met 

125 



B 

T 



-0.83 

0.86 


* 


-0.20 0.52 

Ser 

126 


B 

B 




-0.32 

0.63 


* 


-0.60 0.23 

He 

127 


B 

B 




-0.73 

0.61 


* 


-0.60 0.20 

Gly 

128 


B 

B 




-1.33 

0.97 


* 


-0.60 0.17 

Ite 

129 


B 

B 




-1.47 

0.97 




-0.60 0.13 

Ser 

130 


B 

B 




-0.87 

1.01 


* 


-0.60 0.18 

Phe 

131 


B 

B 




-0.81 

0.73 


* 


-0.60 0.29 

Met 

132 


B 



T 


-0.52 

1.06 




-0.20 0.66 

Gly 

133 




T 

T 


-0.18 

0.99 

* 

* 


0.20 0.48 

Asn 

134 




T 

T 


0.82 

0.60 

* 

* 


0.20 0.93 

Tyr 

135 




T 

T 


0.27 

-0.19 

* 



1.25 1.85 

Met 

136 




T 



0.76 

-0.16 

* 

* 


1.31 1.39 

Asp 

137 




T 



1.04 

-0.16 

* 

* 


1.57 1.33 

Arg 

138 


B 





1.18 

-0.07 

* 


F 

1.58 1.23. 

Val 

139 


B 



T 


1.18 

-0.40 

* 


F 

2.04 1.92 

Pro 

140 


B 



T 


0.83 

-0.61 



F 

2.60 1.99 

Thr 

141 





T 


0.54 

-0.11 



F 

2.24 1.03 

Pro 

142 


B 



T 


0.66 

0.57 


* 

F 

0.73 0.97 

Gin 

143 

A 

B 





-0.04 

-0.07 


* 

F 

1.12 1.23 

Ala 

144 

A 

B 





0.22 

0.00 

* 



0.56 0.86 

lie 

145 

A 

B 





0.43 

0.01 

* 



-0.30 0.56 
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Table XIII summarizes the information corresponding to each "Gene No." described 
above. The nucleotide sequence identified as "NT SEQ ID NO:X" was assembled from 
partially homologous ("overlapping") sequences obtained from the "cDNA clone ID" 
identified in Table XIII and, in some cases, from additional related DNA clones. The 
overlapping sequences were assembled into a single contiguous sequence of high 
redundancy (usually three to five overlapping sequences at each nucleotide position), 
resulting in a final sequence identified as SEQ ID NO:X. 

The cDNA Clone ID was deposited on the date and given the corresponding 
deposit number listed in "ATCC Deposit No:Z and Date." Some of the deposits contain 
multiple different clones corresponding to the same gene. "Vector" refers to the type of 
vector contained in the cDNA Clone ID. 

"Total NT Seq." refers to the total number of nucleotides in the contig identified by 
"Gene No." The deposited clone may contain all or most of these sequences, reflected by 
the nucleotide position indicated as "5' NT of Clone Seq." and the "3' NT of Clone Seq." 
of SEQ ID NO:X. The nucleotide position of SEQ ID NO:X of the putative start codon 
(methionine) is identified as "5' NT of Start Codon." Similarly , the nucleotide position of 
SEQ ID NO:X of the predicted signal sequence is identified as "5* NT of First AA of 
Signal Pep." 

The translated amino acid sequence, beginning with the methionine, is identified as 
"AA SEQ ID NO:Y," although other reading frames can also be easily translated using 
known molecular biology techniques. The polypeptides produced by these alternative 
open reading frames are specifically contemplated by the present invention. 

The first and last amino acid position of SEQ ID NO: Y of the predicted signal 
peptide is identified as "First AA of Sig Pep" and "Last AA of Sig Pep." The predicted 
first amino acid position of SEQ ID NO: Y of the secreted portion is identified as 
"Predicted First AA of Secreted Portion." Finally, the amino acid position of SEQ ID 
NO:Y of the last amino acid in the open reading frame is identified as "Last AA of ORF." 
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SEQ ID NO:X (where X may be any of the polynucleotide sequences disclosed in 
the sequence listing) and the translated SEQ ID NO: Y (where Y may be any of the 
polypeptide sequences disclosed in the sequence listing) are sufficiently accurate and 
otherwise suitable for a variety of uses well known in the art and described further below. 
For instance, SEQ ID NO:X is useful for designing nucleic acid hybridization probes that 
will detect nucleic acid sequences contained in SEQ ID NO:X or the cDNA contained in 
the deposited clone. These probes will also hybridize to nucleic acid molecules in 
biological samples, thereby enabling a variety of forensic and diagnostic methods of the 
invention. Similarly, polypeptides identified from SEQ ID NO:Y may be used, for 
example, to generate antibodies which bind specifically to proteins containing the 
polypeptides and the secreted proteins encoded by the cDNA clones identified in Table 
XIIL 

Nevertheless, DNA sequences generated by sequencing reactions can contain 
sequencing errors. The errors exist as misidentified nucleotides, or as insertions or 
deletions of nucleotides in the generated DNA sequence. The erroneously inserted or 
deleted nucleotides cause frame shifts in the reading frames of the predicted amino acid 
sequence. In these cases, the predicted amino acid sequence diverges from the actual 
amino acid sequence, even though the generated DNA sequence may be greater than 
99.9% identical to the actual DNA sequence (for example, one base insertion or deletion in 
an open reading frame of over 1000 bases). 

Accordingly, for those applications requiring precision in the nucleotide sequence 
or the amino acid sequence, the present invention provides not only the generated 
nucleotide sequence identified as SEQ ID NO:X and the predicted translated amino acid 
sequence identified as SEQ ID NO:Y, but also a sample of plasmid DNA containing a 
human cDNA of the invention deposited with the ATCC, as set forth in Table XIII. The 
nucleotide sequence of each deposited clone can readily be determined by sequencing the 
deposited clone in accordance with known methods. The predicted amino acid sequence 
can then be verified from such deposits. Moreover, the amino acid sequence of the protein 
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encoded by a particular clone can also be directly determined by peptide sequencing or by 
expressing the protein in a suitable host cell containing the deposited human cDNA, 
collecting the protein, and determining its sequence. 

The present invention also relates to the genes corresponding to SEQ ID NO:X, 
SEQ ID NO: Y, or the deposited clone. The corresponding gene can be isolated in 
accordance with known methods using the sequence information disclosed herein. Such 
methods include preparing probes or primers from the disclosed sequence and identifying 
or amplifying the corresponding gene from appropriate sources of genomic material. 

Also provided in the present invention are allelic variants, orthologs, and/or species 
homologs. Procedures known in the art can be used to obtain full-length genes, allelic 
variants, splice variants, full-length coding portions, orthologs, and/or species homologs of 
genes corresponding to SEQ ID NO:X, SEQ ID NO: Y, or a deposited clone, using 
information from the sequences disclosed herein or the clones deposited with the ATCC. 
For example, allelic variants and/or species homologs may be isolated and identified by 
making suitable probes or primers from the sequences provided herein and screening a 
suitable nucleic acid source for allelic variants and/or the desired homologue. 

The polypeptides of the invention can be prepared in any suitable manner. Such 
polypeptides include isolated naturally occurring polypeptides, recombinantly produced 
polypeptides, synthetically produced polypeptides, or polypeptides produced by a 
combination of these methods. Means for preparing such polypeptides are well understood 
in the art. 

The polypeptides may be in the form of the secreted protein, including the mature 
form, or may be a part of a larger protein, such as a fusion protein (see below). It is often 
advantageous to include an additional amino acid sequence which contains secretory or 
leader sequences, pro-sequences, sequences which aid in purification , such as multiple 
histidine residues, or an additional sequence for stability during recombinant production. 

The polypeptides of the present invention are preferably provided in an isolated 
form, and preferably are substantially purified. A recombinantly produced version of a 
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polypeptide, including the secreted polypeptide, can be substantially purified using 
techniques described herein or otherwise known in the art, such as, for example, by the 
one-step method described in Smith and Johnson, Gene 67:31-40 (1988). Polypeptides of 
the invention also can be purified from natural, synthetic or recombinant sources using 
techniques described herein or otherwise known in the art, such as, for example, antibodies 
of the invention raised against the secreted protein. 

The present invention provides a polynucleotide comprising, or alternatively 
consisting of, the nucleic acid sequence of SEQ ID NO:X, and/or a cDNA contained in 
ATCC deposit Z. The present invention also provides a polypeptide comprising, or 
alternatively, consisting of, the polypeptide sequence of SEQ ID NO: Y and/or a 
polypeptide encoded by the cDNA contained in ATCC deposit Z. Polynucleotides 
encoding a polypeptide comprising, or alternatively consisting of the polypeptide sequence 
of SEQ ID NO: Y and/or a polypeptide sequence encoded by the cDNA contained in 
ATCC deposit Z are also encompassed by the invention. 
Signal Sequences 

The present invention also encompasses mature forms of the polypeptide having 
the polypeptide sequence of SEQ ID NO:Y and/or the polypeptide sequence encoded by 
the cDNA in a deposited clone. Polynucleotides encoding the mature forms (such as, for 
example, the polynucleotide sequence in SEQ ID NO:X and/or the polynucleotide 
sequence contained in the cDNA of a deposited clone) are also encompassed by the 
invention. According to the signal hypothesis, proteins secreted by mammalian cells have 
a signal or secretary leader sequence which is cleaved from the mature protein once export 
of the growing protein chain across the rough endoplasmic reticulum has been initiated. 
Most mammalian cells and even insect cells cleave secreted proteins with the same 
specificity. However, in some cases, cleavage of a secreted protein is not entirely uniform, 
which results in two or more mature species of the protein. Further, it has long been 
known that cleavage specificity of a secreted protein is ultimately determined by the 
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primary structure of the complete protein, that is, it is inherent in the amino acid sequence 
of the polypeptide. 

Methods for predicting whether a protein has a signal sequence, as well as the 
cleavage point for that sequence, are available. For instance, the method of McGeoch, 
Virus Res. 3:271-286 (1985), uses the information from a short N-terminal charged region 
and a subsequent uncharged region of the complete (unc leaved) protein. The method of 
von Heinje, Nucleic Acids Res. 14:4683-4690 (1986) uses the information from the 
residues surrounding the cleavage site, typically residues -13 to +2, where +1 indicates the 
amino terminus of the secreted protein. The accuracy of predicting the cleavage points of 
known mammalian secretory proteins for each of these methods is in the range of 75-80%. 
(von Heinje, supra.) However, the two methods do not always produce the same predicted 
cleavage point(s) for a given protein. 

In the present case, the deduced amino acid sequence of the secreted polypeptide 
was analyzed by a computer program called SignalP (Henrik Nielsen et al., Protein 
Engineering 10:1-6 (1997)), which predicts the cellular location of a protein based on the 
amino acid sequence. As part of this computational prediction of localization, the methods 
of McGeoch and von Heinje are incorporated. The analysis of the amino acid sequences of 
the secreted proteins described herein by this program provided the results shown in Table 
XIII. 

As one of ordinary skill would appreciate, however, cleavage sites sometimes vary 
from organism to organism and cannot be predicted with absolute certainty. Accordingly, 
the present invention provides secreted polypeptides having a sequence shown in SEQ ID 
NO:Y which have an N-terminus beginning within 5 residues (i.e., + or - 5 residues) of the 
predicted cleavage point. Similarly, it is also recognized that in some cases, cleavage of 
the signal sequence from a secreted protein is not entirely uniform, resulting in more than 
one secreted species. These polypeptides, and the polynucleotides encoding such 
polypeptides, are contemplated by the present invention. 
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Moreover, the signal sequence identified by the above analysis may not necessarily 
predict the naturally occurring signal sequence. For example, the naturally occurring 
signal sequence may be further upstream from the predicted signal sequence. However, it 
is likely that the predicted signal sequence will be capable of directing the secreted protein 
to the ER. Nonetheless, the present invention provides the mature protein produced by 
expression of the polynucleotide sequence of SEQ ID NO:X and/or the polynucleotide 
sequence contained in the cDNA of a deposited clone, in a mammalian cell (e.g., COS 
cells, as desribed below). These polypeptides, and the polynucleotides encoding such 
polypeptides, are contemplated by the present invention. 

Polynucleotide and Polypeptide Variants 

The present invention is directed to variants of the polynucleotide sequence 
disclosed in SEQ ID NO:X, the complementary strand thereto, and/or the cDNA sequence 
contained in a deposited clone. 

The present invention also encompasses variants of the polypeptide sequence 
disclosed in SEQ ID NO: Y and/or encoded by a deposited clone. 

"Variant" refers to a polynucleotide or polypeptide differing from the 
polynucleotide or polypeptide of the present invention, but retaining essential properties 
thereof. Generally, variants are overall closely similar, and, in many regions, identical to 
the polynucleotide or polypeptide of the present invention. 

The present invention is also directed to nucleic acid molecules which comprise, or 
alternatively consist of, a nucleotide sequence which is at least 80%, 85%, 90%, 95%, 
96%, 97%, 98% or 99% identical to T for example, the nucleotide coding sequence in SEQ 
ID NO:X or the complementary strand thereto, the nucleotide coding sequence contained 
in a deposited cDNA clone or the complementary strand thereto, a nucleotide sequence 
encoding the polypeptide of SEQ ID NO: Y, a nucleotide sequence encoding the 
polypeptide encoded by the cDNA contained in a deposited clone, and/or polynucleotide 
fragments of any of these nucleic acid molecules (e.g., those fragments described herein). 
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Polynucleotides which hybridize to these nucleic acid molecules under stringent 
hybridization conditions or lower stringency conditions are also encompassed by the 
invention, as are polypeptides encoded by these polynucleotides. 

The present invention is also directed to polypeptides which comprise, or 
alternatively consist of, an amino acid sequence which is at least 80%, 85%, 90%, 95%, 
96%, 97%, 98%, 99% identical to, for example, the polypeptide sequence shown in SEQ 
ID NO:Y } the polypeptide sequence encoded by the cDNA contained in a deposited clone, 
and/or polypeptide fragments of any of these polypeptides (e.g., those fragments described 
herein). 

By a nucleic acid having a nucleotide sequence at least, for example, 95% 
"identical" to a reference nucleotide sequence of the present invention, it is intended that 
the nucleotide sequence of the nucleic acid is identical to the reference sequence except 
that the nucleotide sequence may include up to five point mutations per each 100 
nucleotides of the reference nucleotide sequence encoding the polypeptide. In other 
words, to obtain a nucleic acid having a nucleotide sequence at least 95% identical to a 
reference nucleotide sequence, up to 5% of the nucleotides in the reference sequence may 
be deleted or substituted with another nucleotide, or a number of nucleotides up to 5% of 
the total nucleotides in the reference sequence may be inserted into the reference sequence. 
The query sequence may be an entire sequence shown in Table XIII, the ORF (open 
reading frame), or any fragment specified as described herein. 

As a practical matter, whether any particular nucleic acid molecule or polypeptide 
is at least 80%, 85%, 90%, 95%, 96%, 97%, 98% or 99% identical to a nucleotide 
sequence of the presence invention can be determined conventionally using known 
computer programs. A preferred method for determining the best overall match between a 
query sequence (a sequence of the present invention) and a subject sequence, also referred 
to as a global sequence alignment, can be determined using the FASTDB computer 
program based on the algorithm of Brutlag et al. (Comp. App. Biosci. 6:237-245(1990)). 
In a sequence alignment the query and subject sequences are both DNA sequences. An 
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RNA sequence can be compared by converting LPs to T's. The result of said global 
sequence alignment is in percent identity. Preferred parameters used in a FASTDB 
alignment of DNA sequences to calculate percent identiy are: Matrix=Unitary, k-tuple=4, 
Mismatch Penalty=l, Joining Penalty=30, Randomization Group Length=0, Cutoff 
Score=l , Gap Penalty=5, Gap Size Penalty 0.05, Window Size=500 or the lenght of the 
subject nucleotide sequence, whichever is shorter. 

If the subject sequence is shorter than the query sequence because of 5 1 or 3' 
deletions, not because of internal deletions, a manual correction must be made to the 
results. This is because the FASTDB program does not account for 5' and 3' truncations 
of the subject sequence when calculating percent identity. For subject sequences truncated 
at the 5' or 3' ends, relative to the query sequence, the percent identity is corrected by 
calculating the number of bases of the query sequence that are 5' and 3' of the subject 
sequence, which are not matched/aligned, as a percent of the total bases of the query 
sequence. Whether a nucleotide is matched/aligned is determined by results of the 
FASTDB sequence alignment. This percentage is then subtracted from the percent 
identity, calculated by the above FASTDB program using the specified parameters, to 
arrive at a final percent identity score. This corrected score is what is used for the 
purposes of the present invention. Only bases outside the 5' and 3' bases of the subject 
sequence, as displayed by the FASTDB alignment, which are not matched/aligned with the 
query sequence, are calculated for the purposes of manually adjusting the percent identity 
score. 

For example, a 90 base subject sequence is aligned to a 100 base query sequence to 
determine percent identity. The deletions occur at the 5' end of the subject sequence and 
therefore, the FASTDB alignment does not show a matched/alignment of the first 10 bases 
at 5* end. The 10 unpaired bases represent 10% of the sequence (number of bases at the 5* 
and 3' ends not matched/total number of bases in the query sequence) so 10% is subtracted 
from the percent identity score calculated by the FASTDB program. If the remaining 90 
bases were perfectly matched the final percent identity would be 90%. In another 
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example, a 90 base subject sequence is compared with a 100 base query sequence. This 
time the deletions are internal deletions so that there are no bases on the 5' or 3' of the 
subject sequence which are not matched/aligned with the query. In this case the percent 
identity calculated by FASTDB is not manually corrected. Once again, only bases 5' and 
3' of the subject sequence which are not matched/aligned with the query sequence are 
manually corrected for. No other manual corrections are to made for the purposes of the 
present invention. 

By a polypeptide having an amino acid sequence at least, for example, 95% 
"identical" to a query amino acid sequence of the present invention, it is intended that the 
amino acid sequence of the subject polypeptide is identical to the query sequence except 
that the subject polypeptide sequence may include up to five amino acid alterations per 
each 100 amino acids of the query amino acid sequence. In other words, to obtain a 
polypeptide having an amino acid sequence at least 95% identical to a query amino acid 
sequence, up to 5% of the amino acid residues in the subject sequence may be inserted, 
deleted, (indels) or substituted with another amino acid. These alterations of the reference 
sequence may occur at the amino or carboxy terminal positions of the reference amino acid 
sequence or anywhere between those terminal positions, interspersed either individually 
among residues in the reference sequence or in one or more contiguous groups within the 
reference sequence. 

As a practical matter, whether any particular polypeptide is at least 80%, 85%, 
90%, 95%, 96%, 97%, 98% or 99% identical to, for instance, an amino acid sequences 
shown in Table XIII (SEQ ID NO: Y) or to the amino acid sequence encoded by cDNA 
contained in a deposited clone can be determined conventionally using known computer 
programs. A preferred method for determing the best overall match between a query 
sequence (a sequence of the present invention) and a subject sequence, also referred to as a 
global sequence alignment, can be determined using the FASTDB computer program 
based on the algorithm of Brutlag et ah (Comp. App. Biosci. 6:237-245(1990)). In a 
sequence alignment the query and subject sequences are either both nucleotide sequences 
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or both amino acid sequences. The result of said global sequence alignment is in percent 
identity. Preferred parameters used in a FASTDB amino acid alignment are: 
Matrix=PAM 0, k-tuple=2, Mismatch Penalty=l, Joining Penalty=20, Randomization 
Group Length=0, Cutoff Score=l, Window Size=sequence length, Gap Penalty=5, Gap 
Size Penalty=0.05, Window Size=500 or the length of the subject amino acid sequence, 
whichever is shorter. 

If the subject sequence is shorter than the query sequence due to N- or C-terminal 
deletions, not because of internal deletions, a manual correction must be made to the 
results. This is because the FASTDB program does not account for N- and C-terminal 
truncations of the subject sequence when calculating global percent identity. For subject 
sequences truncated at the N- and C-termini, relative to the query sequence, the percent 
identity is corrected by calculating the number of residues of the query sequence that are 
N- and C-terminal of the subject sequence, which are not matched/aligned with a 
corresponding subject residue, as a percent of the total bases of the query sequence. 
Whether a residue is matched/aligned is determined by results of the FASTDB sequence 
ahgnment. This percentage is then subtracted from the percent identity, calculated by the 
above FASTDB program using the specified parameters, to arrive at a final percent 
identity score. This final percent identity score is what is used for the purposes of the 
present invention. Only residues to the N- and C-termini of the subject sequence, which 
are not matched/aligned with the query sequence, are considered for the purposes of 
manually adjusting the percent identity score. That is, only query residue positions outside 
the farthest N- and C-terminal residues of the subject sequence. 

For example, a 90 amino acid residue subject sequence is aligned with a 100 
residue query sequence to determine percent identity. The deletion occurs at the N- 
terminus of the subject sequence and therefore, the FASTDB alignment does not show a 
matching/alignment of the first 10 residues at the N-terminus. The 10 unpaired residues 
represent 10% of the sequence (number of residues at the N- and C- termini not 
matched/total number of residues in the query sequence) so 10% is subtracted from the 
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percent identity score calculated by the FASTDB program. If the remaining 90 residues 
were perfectly matched the final percent identity would be 90%. In another example, a 90 
residue subject sequence is compared with a 100 residue query sequence. This time the 
deletions are internal deletions so there are no residues at the N- or C-termini of the subject 
sequence which are not matched/aligned with the query. In this case the percent identity 
calculated by FASTDB is not manually corrected. Once again, only residue positions 
outside the N- and C-terminal ends of the subject sequence, as displayed in the FASTDB 
alignment, which are not matched/aligned with the query sequnce are manually corrected 
for. No other manual corrections are to made for the purposes of the present invention. 

The variants may contain alterations in the coding regions, non-coding regions, or 
both. Especially preferred are polynucleotide variants containing alterations which 
produce silent substitutions, additions, or deletions, but do not alter the properties or 
activities of the encoded polypeptide. Nucleotide variants produced by silent substitutions 
due to the degeneracy of the genetic code are preferred. Moreover, variants in which 5-10, 
1-5, or 1-2 amino acids are substituted, deleted, or added in any combination are also 
preferred. Polynucleotide variants can be produced for a variety of reasons, e.g., to 
optimize codon expression for a particular host (change codons in the human mRNA to 
those preferred by a bacterial host such as E. coli). 

Naturally occurring variants are called "allelic variants," and refer to one of several 
alternate forms of a gene occupying a given locus on a chromosome of an organism. 
(Genes II, Lewin, B., ed., John Wiley & Sons, New York (1985).) These allelic variants 
can vary at either the polynucleotide and/or polypeptide level and are included in the 
present invention. Alternatively, non-naturally occurring variants may be produced by 
mutagenesis techniques or by direct synthesis. 

Using known methods of protein engineering and recombinant DNA technology, 
variants may be generated to improve or alter the characteristics of the polypeptides of the 
present invention. For instance, one or more amino acids can be deleted from the N- 
terminus or C-terminus of the secreted protein without substantial loss of biological 
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function. The authors of Ron et a]., J. Biol. Chem. 268: 2984-2988 (1993), reported 
variant KGF proteins having heparin binding activity even after deleting 3, 8, or 27 amino- 
terminal amino acid residues. Similarly, Interferon gamma exhibited up to ten times 
higher activity after deleting 8-10 amino acid residues from the carboxy terminus of this 
protein. (Dobeli et al., J. Biotechnology 7:199-216 (1988).) 

Moreover, ample evidence demonstrates that variants often retain a biological 
activity similar to that of the naturally occurring protein. For example, Gayle and 
coworkers (J. Biol. Chem 268:22105-221 1 1 (1993)) conducted extensive mutational 
analysis of human cytokine IL-la. They used random mutagenesis to generate over 3,500 
individual IL-la mutants that averaged 2.5 amino acid changes per variant over the entire 
length of the molecule. Multiple mutations were examined at every possible amino acid 
position. The investigators found that "[m]ost of the molecule could be altered with little 
effect on either [binding or biological activity]." (See, Abstract.) In fact, only 23 unique 
amino acid sequences, out of more than 3,500 nucleotide sequences examined, produced a 
protein that significantly differed in activity from wild-type. 

Furthermore, even if deleting one or more amino acids from the N-terminus or C- 
terminus of a polypeptide results in modification or loss of one or more biological 
functions, other biological activities may still be retained. For example, the ability of a 
deletion variant to induce and/or to bind antibodies which recognize the secreted form will 
likely be retained when less than the majority of the residues of the secreted form are 
removed from the N-terminus or C-terminus. Whether a particular polypeptide lacking N- 
or C-terminal residues of a protein retains such immunogenic activities can readily be 
determined by routine methods described herein and otherwise known in the art. 

Thus, the invention further includes polypeptide variants which show substantial 
biological activity. Such variants include deletions, insertions, inversions, repeats, and 
substitutions selected according to general rules known in the art so as have little effect on 
activity. For example, guidance concerning how to make phenotypically silent amino acid 
substitutions is provided in Bowie et al., Science 247:1306-1310 (1990), wherein the 

SUBSTITUTE SHEET (RULE 26) 


WO 00/29435 PCT/US99/25031 

352 

authors indicate that there are two main strategies for studying the tolerance of an amino 
acid sequence to change. 

The first strategy exploits the tolerance of amino acid substitutions by natural 
selection during the process of evolution. By comparing amino acid sequences in different 
species, conserved amino acids can be identified. These conserved amino acids are likely 
important for protein function. In contrast, the amino acid positions where substitutions 
have been tolerated by natural selection indicates that these positions are not critical for 
protein function. Thus, positions tolerating amino acid substitution could be modified 
while still maintaining biological activity of the protein. 

The second strategy uses genetic engineering to introduce amino acid changes at 
specific positions of a cloned gene to identify regions critical for protein function. For 
example, site directed mutagenesis or alanine-scanning mutagenesis (introduction of single 
alanine mutations at every residue in the molecule) can be used. (Cunningham and Welis, 
Science 244:1081-1085 (1989).) The resulting mutant molecules can then be tested for 
biological activity. 

As the authors state, these two strategies have revealed that proteins are 
surprisingly tolerant of amino acid substitutions. The authors further indicate which amino 
acid changes are likely to be permissive at certain amino acid positions in the protein. For 
example, most buried (within the tertiary structure of the protein) amino acid residues 
require nonpolar side chains, whereas few features of surface side chains are generally 
conserved. Moreover, tolerated conservative amino acid substitutions involve replacement 
of the aliphatic or hydrophobic amino acids Ala, Val, Leu and lie; replacement of the 
hydroxyl residues Ser andThr; replacement of the acidic residues Asp and Glu; 
replacement of the amide residues Asn and Gin, replacement of the basic residues Lys, 
Arg, and His; replacement of the aromatic residues Phe, Tyr, and Trp, and replacement of 
the small-sized amino acids Ala, Ser, Thr, Met, and Gly. 

Besides conservative amino acid substitution, variants of the present invention 
include (i) substitutions with one or more of the non-conserved amino acid residues, where 
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the substituted amino acid residues may or may not be one encoded by the genetic code, or 
(ii) substitution with one or more of amino acid residues having a substituent group, or (iii) 
fusion of the mature polypeptide with another compound, such as a compound to increase 
the stability and/or solubility of the polypeptide (for example, polyethylene glycol), or (iv) 
fusion of the polypeptide with additional amino acids, such as, for example, an IgG Fc 
fusion region peptide, or leader or secretory sequence, or a sequence facilitating 
purification. Such variant polypeptides are deemed to be within the scope of those skilled 
in the art from the teachings herein. 

For example, polypeptide variants containing amino acid substitutions of charged 
amino acids with other charged or neutral amino acids may produce proteins with 
improved characteristics, such as less aggregation. Aggregation of pharmaceutical 
formulations both reduces activity and increases clearance due to the aggregate's 
immunogenic activity. (Pinckard et aL, Clin. Exp. Immunol. 2:331-340 (1967); Robbins et 
al., Diabetes 36: 838-845 (1987); Cleland et aK, Crit. Rev. Therapeutic Drug Carrier 
Systems 10:307-377 (1993).) 

A further embodiment of the invention relates to a polypeptide which comprises the 
amino acid sequence of the present invention having an amino acid sequence which 
contains at least one amino acid substitution, but not more than 50 amino acid 
substitutions, even more preferably, not more than 40 amino acid substitutions, still more 
preferably, not more than 30 amino acid substitutions, and still even more preferably, not 
more than 20 amino acid substitutions. Of course, in order of ever-increasing preference, it 
is highly preferable for a peptide or polypeptide to have an amino acid sequence which 
comprises the amino acid sequence of the present invention, which contains at least one, 
but not more than 10, 9, 8, 7, 6, 5, 4, 3, 2 or 1 amino acid substitutions. In specific 
embodiments, the number of additions, substitutions, and/or deletions in the amino acid 
sequence of the present invention or fragments thereof (e.g., the mature form and/or other 
fragments described herein), is 1-5, 5-10, 5-25, 5-50, 10-50 or 50-150, conservative amino 
acid substitutions are preferable. 
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Polynucleotide and Polypeptide Fra ff inpnf g 

The present invention is also directed to polynucleotide fragments of the 
polynucleotides of the invention. 

In the present invention, a "polynucleotide fragment" refers to a short 
polynucleotide having a nucleic acid sequence which: is a portion of that contained in a 
deposited clone, or encoding the polypeptide encoded by the cDNA in a deposited clone; is 
a portion of that shown in SEQ ID NO:X or the complementary strand thereto, or is a 
portion of a polynucleotide sequence encoding the polypeptide of SEQ ID NO:Y. The 
nucleotide fragments of the invention are preferably at least about 15 nt, and more 
preferably at least about 20 nt, still more preferably at least about 30 nt, and even more 
preferably, at least about 40 nt, at least about 50 nt, at least about 75 nt, or at least about 
150 nt in length. A fragment "at least 20 nt in length," for example, is intended to include 
20 or more contiguous bases from the cDNA sequence contained in a deposited clone or 
the nucleotide sequence shown in SEQ ID NO:X. In this context "about" includes the 
particularly recited value, a value larger or smaller by several (5, 4, 3 T 2, or 1) nucleotides, 
at either terminus or at both termini. These nucleotide fragments have uses that include, 
but are not limited to, as diagnostic probes and primers as discussed herein. Of course, 
larger fragments (e.g., 50, 150, 500, 600, 2000 nucleotides) are preferred. 

Moreover, representative examples of polynucleotide fragments of the invention, 
include, for example, fragments comprising, or alternatively consisting of, a sequence from 
about nucleotide number 1-50, 51-100, 101-150, 151-200, 201-250, 251-300, 301-350, 
351-400, 401-450, 451-500, 501-550, 551-600, 651-700, 701-750, 751-800, 800-850, 851- 
900, 901-950, 951-1000, 1001-1050, 1051-1100, 1101-1150, 1151-1200, 1201-1250, 
1251-1300, 1301-1350, 1351-1400, 1401-1450, 1451-1500, 1501-1550, 1551-1600, 1601- 
1650, 1651-1700, 1701-1750, 1751-1800, 1801-1850, 1851-1900, 1901-1950, 1951-2000, 
or 2001 to the end of SEQ ID NO:X, or the complementary strand thereto, or the cDNA 
contained in a deposited clone. In this context "about" includes the particularly recited 
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ranges, and ranges larger or smaller by several (5, 4, 3, 2, or 1) nucleotides, at either 
terminus or at both termini. Preferably, these fragments encode a polypeptide which has 
biological activity. More preferably, these polynucleotides can be used as probes or 
primers as discussed herein. Polynucleotides which hybridize to these nucleic acid 
molecules under stringent hybridization conditions or lower stringency conditions are also 
encompassed by the invention, as are polypeptides encoded by these polynucleotides. 

In the present invention, a "polypeptide fragment" refers to an amino acid sequence 
which is a portion of that contained in SEQ ID NO: Y or encoded by the cDNA contained 
in a deposited clone. Protein (polypeptide) fragments may be "free-standing," or 
comprised within a larger polypeptide of which the fragment forms a part or region, most 
preferably as a single continuous region. Representative examples of polypeptide 
fragments of the invention, include, for example, fragments comprising, or alternatively 
consisting of, from about amino acid number 1-20, 21-40, 41-60, 61-80, 81-100, 102-120, 
121-140, 141-160, or 161 to the end of the coding region. Moreover, polypeptide 
fragments can be about 20, 30, 40, 50, 60, 70, 80, 90, 100, 110, 120, 130, 140, or 150 
amino acids in length. In this context "about" includes the particularly recited ranges or 
values, and ranges or values larger or smaller by several (5, 4, 3, 2, or 1) amino acids, at 
either extreme or at both extremes. Polynucleotides encoding these polypeptides are also 
encompassed by the invention. 

Preferred polypeptide fragments include the secreted protein as well as the mature 
form. Further preferred polypeptide fragments include the secreted protein or the mature 
form having a continuous series of deleted residues from the amino or the carboxy 
terminus, or both. For example, any number of amino acids, ranging from 1-60, can be 
deleted from the amino terminus of either the secreted polypeptide or the mature form. 
Similarly, any number of amino acids, ranging from 1-30, can be deleted from the carboxy 
terminus of the secreted protein or mature form. Furthermore, any combination of the 
above amino and carboxy terminus deletions are preferred. Similarly, polynucleotides 
encoding these polypeptide fragments are also preferred. 
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Also preferred are polypeptide and polynucleotide fragments characterized by 
structural or functional domains, such as fragments that comprise alpha-helix and alpha- 
helix forming regions, beta-sheet and beta-sheet-forming regions, turn and turn-forming 
regions, coil and coil-forming regions, hydrophilic regions, hydrophobic regions, alpha 
amphipathic regions, beta amphipathic regions, flexible regions, surface -forming regions, 
substrate binding region, and high antigenic index regions. Polypeptide fragments of SEQ 
ID NO:Y falling within conserved domains are specifically contemplated by the present 
invention. Moreover, polynucleotides encoding these domains are also contemplated. 

Other preferred polypeptide fragments are biologically active fragments. 
Biologically active fragments are those exhibiting activity similar, but not necessarily 
identical, to an activity of the polypeptide of the present invention. The biological activity 
of the fragments may include an improved desired activity, or a decreased undesirable 
activity. Polynucleotides encoding these polypeptide fragments are also encompassed by 
the invention. 

Preferably, the polynucleotide fragments of the invention encode a polypeptide 
which demonstrates a functional activity. By a polypeptide demonstrating a "functional 
activity" is meant, a polypeptide capable of displaying one or more known functional 
activities associated with a full-length (complete) polypeptide of invention protein. Such 
functional activities include, but are not limited to, biological activity, antigenicity [ability 
to bind (or compete with a polypeptide of the invention for binding) to an antibody to the 
polypeptide of the invention], immunogenicity (ability to generate antibody which binds to 
a polypeptide of the invention), ability to form multimers with polypeptides of the 
invention, and ability to bind to a receptor or ligand for a polypeptide of the invention. 

The functional activity of polypeptides of the invention, and fragments, variants 
derivatives, and analogs thereof, can be assayed by various methods. 

For example, in one embodiment where one is assaying for the ability to bind or 
compete with full-length polypeptide of the invention for binding to an antibody of the 
polypeptide of the invention, various immunoassays known in the art can be used, 
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including but not limited to, competitive and non-competitive assay systems using 
techniques such as radioimmunoassays, ELISA (enzyme linked immunosorbent assay), 
"sandwich" immunoassays, immunoradiometric assays, gel diffusion precipitation 
reactions, immunodiffusion assays, in situ immunoassays (using colloidal gold, enzyme or 
radioisotope labels, for example), western blots, precipitation reactions, agglutination 
assays (e.g., gel agglutination assays, hemagglutination assays), complement fixation 
assays, immunofluorescence assays, protein A assays, and Immunoelectrophoresis assays, 
etc. In one embodiment, antibody binding is detected by detecting a label on the primary 
antibody. In another embodiment, the primary antibody is detected by detecting binding of 
a secondary antibody or reagent to the primary antibody. In a further embodiment, the 
secondary antibody is labeled. Many means are known in the art for detecting binding in 
an immunoassay and are within the scope of the present invention. 

In another embodiment, where a ligand for a polypeptide of the invention 
identified, or the ability of a polypeptide fragment, variant or derivative of the invention to 
multimerize is being evaluated, binding can be assayed, e.g., by means well-known in the 
art, such as, for example, reducing and non-reducing gel chromatography, protein affinity 
chromatography, and affinity blotting. See generally, Phizicky, E., et al., 1995, Microbiol. 
Rev. 59:94-123. In another embodiment, physiological correlates of binding of a 
polypeptide of the invention to its substrates (signal transduction) can be assayed. 

In addition, assays described herein (see Examples) and otherwise known in the art 
may routinely be applied to measure the ability of polypeptides of the invention and 
fragments, variants derivatives and analogs thereof to elicit related biological activity 
related to that of the polypeptide of the invention (either in vitro or in vivo). Other 
methods will be known to the skilled artisan and are within the scope of the invention. 

Epitopes & Antibodies 

The present invention is also directed to polypeptide fragments comprising, or 
alternatively consisting of, an epitope of the polypeptide sequence shown in SEQ ID 
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NO: Y, or the polypeptide sequence encoded by the cDNA contained in a deposited clone. 
Polynucleotides encoding these epitopes (such as, for example, the sequence disclosed in 
SEQ ID NO:X) are also encompassed by the invention, as is the nucleotide sequences of 
the complementary strand of the polynucleotides encoding these epitopes. And 
polynucleotides which hybridize to the complementary strand under stringent hybridization 
conditions or lower stringency conditions. 

In the present invention, "epitopes" refer to polypeptide fragments having antigenic 
or immunogenic activity in an animal, especially in a human. A preferred embodiment of 
the present invention relates to a polypeptide fragment comprising an epitope, as well as 
the polynucleotide encoding this fragment. A region of a protein molecule to which an 
antibody can bind is defined as an "antigenic epitope." In contrast, an "immunogenic 
epitope" is defined as a part of a protein that elicits an antibody response. (See, for 
instance, Geysen et al., Proc. Natl. Acad. Sci. USA 81:3998- 4002 (1983).) 

Fragments which function as epitopes may be produced by any conventional 
means. (See, e.g., Houghten, R. A., Proc. Natl. Acad. Sci. USA 82:5131-5135 (1985) 
further described in U.S. Patent No. 4,631,211.) 

In the present invention, antigenic epitopes preferably contain a sequence of at least 
4, at least 5, at least 6, at least 7, more preferably at least 8, at least 9, at least 10, at least 
15, at least 20, at least 25, and most preferably between about 15 to about 30 amino acids. 
Preferred polypeptides comprising immunogenic or antigenic epitopes are at least 10, 15, 
20, 25, 30, 35, 40, 45, 50, 55, 60, 65, 70, 75, 80, 85, 90, 95, or 100 amino acid residues in 
length. Antigenic epitopes are useful, for example, to raise antibodies, including 
monoclonal antibodies, that specifically bind the epitope. (See, for instance, Wilson et al., 
Cell 37:767-778 (1984); Sutcliffe et al., Science 219:660-666 (1983).) 

Similarly, immunogenic epitopes can be used, for example, to induce antibodies 
according to methods well known in the art. (See, for instance, Sutcliffe et al., supra; 
Wilson et al., supra; Chow et al., Proc. Natl. Acad. Sci. USA 82:910-914; and Bittle et al., 
J. Gen. Virol. 66:2347-2354 (1985).) A preferred immunogenic epitope includes the 
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secreted protein. The immunogenic epitopes may be presented together with a carrier 
protein, such as an albumin, to an animal system (such as rabbit or mouse) or, if it is long 
enough (at least about 25 amino acids), without a carrier. However, immunogenic epitopes 
comprising as few as 8 to 10 amino acids have been shown to be sufficient to raise 
antibodies capable of binding to, at the very least, linear epitopes in a denatured 
polypeptide (e.g., in Western blotting.) 

Epitope-bearing polypeptides of the present invention may be used to induce 
antibodies according to methods well known in the art including, but not limited to, in vivo 
immunization, in vitro immunization, and phage display methods. See, e.g., Sutcliffe et 
al., supra; Wilson et al., supra, and Bittle et al., J. Gen. Virol., 66:2347-2354 (1985). If in 
vivo immunization is used, animals may be immunized with free peptide; however, 
anti-peptide antibody titer may be boosted by coupling of the peptide to a macromolecular 
carrier, such as keyhole limpet hemacyanin (KLH) or tetanus toxoid. For instance, 
peptides containing cysteine residues may be coupled to a carrier using a linker such as 
-maleimidobenzoyl- N-hydroxysuccinimide ester (MBS), while other peptides may be 
coupled to carriers using a more general linking agent such as glutaraldehyde. Animals 
such as rabbits, rats and mice are immunized with either free or carrier-coupled peptides, 
for instance, by intraperitoneal and/or intradermal injection of emulsions containing about 
100 ^gs of peptide or carrier protein and Freund's adjuvant. Several booster injections 
may be needed, for instance, at intervals of about two weeks, to provide a useful titer of 
anti-peptide antibody which can be detected, for example, by ELISA assay using free 
peptide adsorbed to a solid surface. The titer of anti-peptide antibodies in serum from an 
immunized animal may be increased by selection of anti-peptide antibodies, for instance, 
by adsorption to the peptide on a solid support and elution of the selected antibodies 
according to methods well known in the art. 

As one of skill in the art will appreciate, and discussed above, the polypeptides of 
the present invention comprising an immunogenic or antigenic epitope can be fused to 
heterologous polypeptide sequences. For example, the polypeptides of the present 
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invention may be fused with the constant domain of immunoglobulins (IgA, IgE, IgG, 
IgM), or portions thereof (CHI, CH2, CH3, any combination thereof including both entire 
domains and portions thereof) resulting in chimeric polypeptides. These fusion proteins 
facilitate purification, and show an increased half-life in vivo. This has been shown, e.g., 
for chimeric proteins consisting of the first two domains of the human CD4-polypeptide 
and various domains of the constant regions of the heavy or light chains of mammalian 
immunoglobulins. See, e.g., EPA 0,394,827; Traunecker et al., Nature, 331:84-86 (1988). 
Fusion proteins that have a disulfide-linked dimeric structure due to the IgG portion can 
also be more efficient in binding and neutralizing other molecules than monomeric 
polypeptides or fragments thereof alone. See, e.g., Fountoulakis et al., J. Biochem., 
270:3958-3964 (1995). Nucleic acids encoding the above epitopes can also be recombined 
with a gene of interest as an epitope tag to aid in detection and purification of the 
expressed polypeptide. 

Additional fusion proteins of the invention may be generated through the 
techniques of gene-shuffling, motif -shuffling, ex on -shuffling, and/or codon-shuffling 
(collectively referred to as "DNA shuffling"). DNA shuffling may be employed to 
modulate the activities of polypeptides corresponding to SEQ ID NO:Y thereby effectively 
generating agonists and antagonists of the polypeptides. See,generally, U.S. Patent Nos. 
5,605,793, 5,811,238, 5,830,721, 5,834,252, and 5,837,458, and Patten, P.A., et al., Curr. 
Opinion Biotechnol. 8:724-33 (1997); Harayama, S., Trends Biotechnol. 16(2):76-82 
(1998); Hansson, L.O., et al., J. Mol. Biol. 287:265-76 (1999); and Lorenzo, M. M. and 
Blasco, R., Biotechniques 24(2):308-13 (1998) (each of these patents and publications are 
hereby incorporated by reference). In one embodiment, alteration of polynucleotides 
corresponding to SEQ ID NO:X and corresponding polypeptides may be achieved by DNA 
shuffling. DNA shuffling involves the assembly of two or more DNA segments into a 
desired molecule corresponding to SEQ ID NO:X polynucleotides of the invention by 
homologous, or site-specific, recombination. In another embodiment, polynucleotides 
corresponding to SEQ ID NO:X and corresponding polypeptides may be altered by being 
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subjected to random mutagenesis by error-prone PCR, random nucleotide insertion or 
other methods prior to recombination. In another embodiment, one or more components, 
motifs, sections, parts, domains, fragments, etc., of coding polynucleotide corresponding to 
SEQ ID NO:X, or the polypeptide encoded thereby may be recombined with one or more 
components, motifs, sections, parts, domains, fragments, etc. of one or more heterologous 
molecules. 

Antibodies 

The present invention further relates to antibodies and T-cell antigen receptors 
(TCR) which specifically bind the polypeptides of the present invention. The antibodies of 
the present invention include IgG (including IgGl, IgG2, IgG3, and IgG4), IgA (including 
IgAl and IgA2), IgD, IgE, or IgM, and IgY. As used herein, the term "antibody" (Ab) is 
meant to include whole antibodies, including single-chain whole antibodies, and antigen- 
binding fragments thereof. Most preferably the antibodies are human antigen binding 
antibody fragments of the present invention and include, but are not limited to, Fab, Fab' 
and F(ab')2, Fd, single-chain Fvs (scFv), single-chain antibodies, disulfide-linked Fvs 
(sdFv) and fragments comprising either a V L or V H domain. The antibodies may be from 
any animal origin including birds and mammals. Preferably, the antibodies are human, 
murine, rabbit, goat, guinea pig, camel, horse, or chicken. 

Antigen-binding antibody fragments, including single-chain antibodies, may 
comprise the variable region(s) alone or in combination with the entire or partial of the 
following: hinge region, CHI, CH2, and CH3 domains. Also included in the invention are 
any combinations of variable region(s) and hinge region, CHI, CH2, and CH3 domains. 
The present invention further includes monoclonal, polyclonal, chimeric, humanized, and 
human monoclonal and human polyclonal antibodies which specifically bind the 
polypeptides of the present invention. The present invention further includes antibodies 
which are anti-idiotypic to the antibodies of the present invention. 

The antibodies of the present invention may be monospecific, bispecific, trispecific 

SUBSTITUTE SHEET (RULE 26) 


WO 00/29435 PCT7US99/25031 

362 

or of greater multispecificity. Multispecific antibodies may be specific for different 
epitopes of a polypeptide of the present invention or may be specific for both a polypeptide 
of the present invention as well as for heterologous compositions, such as a heterologous 
polypeptide or solid support material. See, e.g., WO 93/17715; WO 92/08802; WO 
91/00360; WO 92/05793; Tutt, et al., J. Immunol 147:60-69 (1991); US Patents 
5,573,920, 4,474,893, 5,601,819, 4,714,681, 4,925,648; Kostelny et al., J. Immunol. 
148:1547-1553 (1992). 

Antibodies of the present invention may be described or specified in terms of the 
epitope(s) or portion(s) of a polypeptide of the present invention which are recognized or 
specifically bound by the antibody. The epitope(s) or polypeptide portion(s) may be 
specified as described herein, e.g., by N-terminal and C-terminal positions, by size in 
contiguous amino acid residues, or listed in the Tables and Figures. Antibodies which 
specifically bind any epitope or polypeptide of the present invention may also be excluded. 
Therefore, the present invention includes antibodies that specifically bind polypeptides of 
the present invention, and allows for the exclusion of the same. 

Antibodies of the present invention may also be described or specified in terms of 
their cross-reactivity. Antibodies that do not bind any other analog, ortholog, or homolog 
of the polypeptides of the present invention are included. Antibodies that do not bind 
polypeptides with less than 95%, less than 90%, less than 85%, less than 80%, less than 
75%, less than 70%, less than 65%, less than 60%, less than 55%, and less than 50% 
identity (as calculated using methods known in the art and described herein) to a 
polypeptide of the present invention are also included in the present invention. Further 
included in the present invention are antibodies which only bind polypeptides encoded by 
polynucleotides which hybridize to a polynucleotide of the present invention under 
stringent hybridization conditions (as described herein). Antibodies of the present 
invention may also be described or specified in terms of their binding affinity. Preferred 
binding affinities include those with a dissociation constant or Kd less than 5XI0* 6 M, 10" 
6 M, 5X10 7 M, 10" 7 M, 5X10- 8 M, 10 8 M, 5X10 °M, 10 9 M, 5X10 10 M, 10- 10 M, 5X10 n M, 10' 
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n M, 5X10" I2 M, 10' I2 M, 5X10- ,3 M, 10 I3 M, 5X1()- 14 M, lO^M, 5X10- ,5 M, and 10 I5 M. 

Antibodies of the present invention have uses that include, but are not limited to, 
methods known in the art to purify, detect, and target the polypeptides of the present 
invention including both in vitro and in vivo diagnostic and therapeutic methods. For 
example, the antibodies have use in immunoassays for qualitatively and quantitatively 
measuring levels of the polypeptides of the present invention in biological samples. See, 
e.g., Harlow et ah, ANTIBODIES: A LABORATORY MANUAL, (Cold Spring Harbor 
Laboratory Press, 2nd ed. 1988) (incorporated by reference in the entirety). 

The antibodies of the present invention may be used either alone or in combination 
with other compositions. The antibodies may further be recombinantly fused to a 
heterologous polypeptide at the N- or C-terminus or chemically conjugated (including 
covaiently and non-covalently conjugations) to polypeptides or other compositions. For 
example, antibodies of the present invention may be recombinantly fused or conjugated to 
molecules useful as labels in detection assays and effector molecules such as heterologous 
polypeptides, drugs, or toxins. See, e.g., WO 92/08495; WO 91/14438; WO 89/12624; US 
Patent 5,314,995; and EP 0 396 387. 

The antibodies of the present invention may be prepared by any suitable method known in 
the art. For example, a polypeptide of the present invention or an antigenic fragment 
thereof can be administered to an animal in order to induce the production of sera 
containing polyclonal antibodies. The term "monoclonal antibody" is not a limited to 
antibodies produced through hybridoma technology. The term "monoclonal antibody" 
refers to an antibody that is derived from a single clone, including any eukaryotic, 
prokaryotic, or phage clone, and not the method by which it is produced. Monoclonal 
antibodies can be prepared using a wide variety of techniques known in the art including 
the use of hybridoma, recombinant, and phage display technology. 

Hybridoma techniques include those known in the art and taught in Harlow et al. s 
ANTIBODIES: A LABORATORY MANUAL, (Cold Spring Harbor Laboratory Press, 
2nd ed. 1988); Hammeriing, et al., in: MONOCLONAL ANTIBODIES AND T-CELL 
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HYBRIDOMAS 563-681 (Elsevier, N.Y., 1981) (said references incorporated by 
reference in their entireties). Fab and F(ab')2 fragments may be produced by proteolytic 
cleavage, using enzymes such as papain (to produce Fab fragments) or pepsin (to produce 
F(ab')2 fragments). 

5 Alternatively, antibodies of the present invention can be produced through the 

application of recombinant DNA and phage display technology or through synthetic 
chemistry using methods known in the art. For example, the antibodies of the present 
invention can be prepared using various phage display methods known in the art. In phage 
display methods, functional antibody domains are displayed on the surface of a phage 

10 particle which carries polynucleotide sequences encoding them. Phage with a desired 
binding property are selected from a repertoire or combinatorial antibody library (e.g. 
human or murine) by selecting directly with antigen, typically antigen bound or captured to 
a solid surface or bead. Phage used in these methods are typically filamentous phage 
including fd and M13 with Fab, Fv or disulfide stabilized Fv antibody domains 

15 recombinantly fused to either the phage gene III or gene VIII protein. Examples of phage 
display methods that can be used to make the antibodies of the present invention include 
those disclosed in Brinkman et ah, J. Immunol. Methods 182:41-50 (1995); Ames et al., J. 
Immunol. Methods 184:177-186 (1995); Kettleborough et al., Eur. J. Immunol. 24:952-958 
(1994); Persic et al., Gene 187 9-18 (1997); Burton et al., Advances in Immunology 

20 57:191-280 (1994); PCT/GB91/01 134; WO 90/02809; WO 91/10737; WO 92/01047; WO 
92/18619; WO 93/11236; WO 95/15982; WO 95/20401; and US Patents 5,698,426, 
5,223,409, 5,403,484, 5,580,717, 5,427,908, 5,750,753, 5,821,047, 5,571,698, 5,427,908, 
5,516,637, 5,780,225, 5,658,727 and 5,733,743 (said references incorporated by reference 
in their entireties). 

25 As described in the above references, after phage selection, the antibody coding 

regions from the phage can be isolated and used to generate whole antibodies, including 
human antibodies, or any other desired antigen binding fragment, and expressed in any 
desired host including mammalian cells, insect cells, plant cells, yeast, and bacteria. For 
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example, techniques to recombinantly produce Fab, Fab' and F(ab')2 fragments can also be 
employed using methods known in the art such as those disclosed in WO 92/22324; 
Muliinax et al., BioTechniques 12(6):864-869 (1992); and Sawai et ah, AJRI 34:26-34 
(1995); and Better et al., Science 240:1041-1043 (1988) (said references incorporated by 
reference in their entireties). 

Examples of techniques which can be used to produce single-chain Fvs and 
antibodies include those described in U.S. Patents 4,946,778 and 5,258,498; Huston et al., 
Methods in Enzymology 203:46-88 (1991); Shu, L. et al., PNAS 90:7995-7999 (1993); 
and Skerra et al., Science 240:1038-1040 (1988). For some uses, including in vivo use of 
antibodies in humans and in vitro detection assays, it may be preferable to use chimeric, 
humanized, or human antibodies. Methods for producing chimeric antibodies are known 
in the art. See e.g., Morrison, Science 229:1202 (1985); Oi et al., BioTechniques 4:214 
(1986); Gillies et al., (1989) J. Immunol. Methods 125:191-202; and US Patent 5,807,715. 
Antibodies can be humanized using a variety of techniques including CDR-grafting (EP 0 
239 400; WO 91/09967; US Patent 5,530,101; and 5,585,089), veneering or resurfacing 
(EP 0 592 106; EP 0 519 596; Padlan E.A., Molecular Immunology 28(4/5):489-498 
(1991); Studnicka et al., Protein Engineering 7(6):805-814 (1994); Roguska. et al., PNAS 
91:969-973 (1994)), and chain shuffling (US Patent 5,565,332). Human antibodies can be 
made by a variety of methods known in the art including phage display methods described 
above. See also, US Patents 4,444,887, 4,716,11 1, 5,545,806, and 5,814,318; and WO 
98/46645, WO 98/50433, WO 98/24893, WO 98/16654, WO 96/34096, WO 96/33735, 
and WO 91/10741 (said references incorporated by reference in their entireties). 

Further included in the present invention are antibodies recombinantly fused or 
chemically conjugated (including both covalently and non-covalently conjugations) to a 
polypeptide of the present invention. The antibodies may be specific for antigens other 
than polypeptides of the present invention. For example, antibodies may be used to target 
the polypeptides of the present invention to particular cell types, either in vitro or in vivo, 
by fusing or conjugating the polypeptides of the present invention to antibodies specific for 

SUBSTITUTE SHEET (RULE 26) 


WO 00/29435 PCT/US99/25031 

366 

particular cell surface receptors. Antibodies fused or conjugated to the polypeptides of the 
present invention may also be used in in vitro immunoassays and purification methods 
using methods known in the art. See e.g., Harbor et al. supra and WO 93/21232; EP 0 439 
095; Naramura et al., Immunol. Lett. 39:91-99 (1994); US Patent 5,474,981; Gillies et al., 
PNAS 89:1428-1432 (1992); Fell et al., J. Immunol. 146:2446-2452 (1991) (said 
references incorporated by reference in their entireties). 

The present invention further includes compositions comprising the polypeptides of 
the present invention fused or conjugated to antibody domains other than the variable 
regions. For example, the polypeptides of the present invention may be fused or 
conjugated to an antibody Fc region, or portion thereof. The antibody portion fused to a 
polypeptide of the present invention may comprise the hinge region, CHI domain, CH2 
domain, and CH3 domain or any combination of whole domains or portions thereof. The 
polypeptides of the present invention may be fused or conjugated to the above antibody 
portions to increase the in vivo half life of the polypeptides or for use in immunoassays 
using methods known in the art. The polypeptides may also be fused or conjugated to the 
above antibody portions to form multimers. For example, Fc portions fused to the 
polypeptides of the present invention can form dinners through disulfide bonding between 
the Fc portions. Higher multimeric forms can be made by fusing the polypeptides to 
portions of IgA and IgM. Methods for fusing or conjugating the polypeptides of the present 
invention to antibody portions are known in the art. See e.g., US Patents 5,336,603, 
5,622,929, 5,359,046, 5,349,053, 5,447,851, 5,1 12,946; EP 0 307 434, EP 0 367 166; WO 
96/04388, WO 91/06570; Ashkenazi et al., PNAS 88:10535-10539 (1991); Zheng et al., J. 
Immunol. 154:5590-5600 (1995); and Vil et al., PNAS 89:11337-11341 (1992) (said 
references incorporated by reference in their entireties). 

The invention further relates to antibodies which act as agonists or antagonists of 
the polypeptides of the present invention. For example, the present invention includes 
antibodies which disrupt the receptor/1 igand interactions with the polypeptides of the 
invention either partially or fully. Included are both receptor-specific antibodies and 
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ligand-specific antibodies. Included are receptor-specific antibodies which do not prevent 
ligand binding but prevent receptor activation. Receptor activation (i.e., signaling) may be 
determined by techniques described herein or otherwise known in the art. Also included 
are receptor-specific antibodies which both prevent ligand binding and receptor activation. 
5 Likewise, included are neutralizing antibodies which bind the ligand and prevent binding 
of the ligand to the receptor, as well as antibodies which bind the ligand, thereby 
preventing receptor activation, but do not prevent the ligand from binding the receptor. 
Further included are antibodies which activate the receptor. These antibodies may act as 
agonists for either all or less than all of the biological activities affected by ligand- 

10 mediated receptor activation. The antibodies may be specified as agonists or antagonists 
for biological activities comprising specific activities disclosed herein. The above 
antibody agonists can be made using methods known in the art. See e.g., WO 96/40281; 
US Patent 5,811,097; Deng et al., Blood 92(6): 1981-1988 (1998); Chen, et al., Cancer Res. 
58(16):3668-3678 (1998); Harrop et al., J. Immunol. 161(4): 1786-1794 (1998); Zhu et al., 

15 Cancer Res. 58(15):3209-3214 (1998); Yoon, et al., J. Immunol. 160(7):3170-3179 (1998); 
Prat et al., J. Cell. Sci. lll(Pt2):237-247 (1998); Pitard et al., J. Immunol. Methods 
205(2):177-190 (1997); Liautard et al., Cytokinde 9(4):233-241 (1997); Carlson et al, J. 
Biol. Chem. 272(17): 11295-1 1301 (1997); Taryman et al., Neuron 14(4):755-762 (1995); 
Muller et al., Structure 6(9):1 153-1 167 (1998); Bartunek et al., Cytokine 8(l):14-20 (1996) 

20 (said references incorporated by reference in their entireties). 

As discussed above, antibodies to the polypeptides of the invention can, in turn, be 
utilized to generate anti-idiotype antibodies that "mimic" polypeptides of the invention 
using techniques well known to those skilled in the art. (See, e.g., Greenspan & Bona, 
FASEB J. 7(5):437-444; (1989) and Nissinoff, J. Immunol. 147(8):2429-2438 (1991)). 

25 For example, antibodies which bind to and competitively inhibit polypeptide 
multimerization and/or binding of a polypeptide of the invention to ligand can be used to 
generate anti-idiotypes that "mimic" the polypeptide mutimerization and/or binding 
domain and, as a consequence, bind to and neutralize polypeptide and/or its ligand. Such 
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neutralizing anti-idiotypes or Fab fragments of such antiidiotypes can be used in 
therapeutic regimens to neutralize polypeptide ligand. For example, such anti -idiotypic 
antibodies can be used to bind a polypeptide of the invention and/or to bind its 
ligands/receptors, and thereby block its biological activity. 

The invention further relates to a diagnostic kit for use in screening serum 
containing antibodies specific against proliferative and/or cancerous polynucleotides and 
polypeptides. Such a kit may include a substantially isolated polypeptide antigen 
comprising an epitope which is specifically immunoreactive with at least one anti- 
polypeptide antigen antibody. Such a kit also includes means for detecting the binding of 
said antibody to the antigen. In specific embodiments, the kit may include a recombinantly 
produced or chemically synthesized polypeptide antigen. The polypeptide antigen of the 
kit may also be attached to a solid support. 

In a more specific embodiment the detecting means of the above -de scribed kit 
includes a solid support to which said polypeptide antigen is attached. Such a kit may also 
include a non-attached reporter-labelled anti-human antibody. In this embodiment, binding 
of the antibody to the polypeptide antigen can be detected by binding of the said reporter- 
labelled antibody. 

The invention further includes a method of detecting proliferative and/or cancerous 
disorders and conditions in a test subject. This detection method includes reacting serum 
from a test subject (e.g. one in which proliferative and/or cancerous cells or tissues may be 
present) with a substantially isolated polypeptide and/or polynucleotide antigen, and 
examining the antigen for the presence of bound antibody. In a specific embodiment, the 
method includes a polypeptide antigen attached to a solid support, and the serum is reacted 
with the support. Subsequently, the support is reacted with a reporter labelled anti-human 
antibody. The solid support is then examined for the presence of reporter-labelled 
antibody. 

Additionally, the invention includes a proliferative condition vaccine composition. 
The composition includes a substantially isolated polypeptide and/or polynucleotide 
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antigen, where the antigen includes an epitope which is specifically immunoreactive with 
at least antibody specific for the epitope. The peptide and/or polynucleotide antigen may 
be produced according to methods known in the art, including recombinant expression or 
chemical synthesis. The peptide antigen is preferably present in a pharmacologically 
5 effective dose in a pharmaceutic ally acceptable carrier. 

Further, the invention includes a monoclonal antibody that is specifically 
immunoreactive with polypeptide and/or polynucleotide epitopes. The invention includes a 
substantially isolated preparation of polyclonal antibodies specifically immunoreactive 
with polynucleotides and/or polypeptides of the present invention. In a more specific 
10 embodiment, such polyclonal antibodies are prepared by affinity chromatography, in 
addition to, other methods known in the art. 

In another emodiment, the invention includes a method for producing antibodies to 
polypeptide and/or polynucleotide antigens. The method includes administering to a test 
subject a substantially isolated polypeptide and/or polynucleotide antigen, where the 
15 antigen includes an epitope which is specifically immunoreactive with at least one anti- 
polypeptide and/or polynucleotide antibody. The antigen is administered in an amount 
sufficient to produce an immune response in the subject. 

In an additional embodiment, the invention includes a diagnostic kit for use in 
screening serum containing antigens of the polypeptide of the invention. The diagnostic kit 
20 includes a substantially isolated antibody specifically immunoreactive with polypeptide or 
polynucleotide antigens, and means for detecting the binding of the polynucleotide or 
polypeptide antigen to the antibody. In one embodiment, the antibody is attached to a solid 
support. In a specific embodiment, the antibody may be a monoclonal antibody. The 
detecting means of the kit may include a second, labelled monoclonal antibody. 
25 Alternatively, or in addition, the detecting means may include a labelled, competing 
antigen. 

In one diagnostic configuration, test serum is reacted with a solid phase reagent 
having a surface-bound antigen obtained by the methods of the present invention. After 
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binding with specific antigen antibody to the reagent and removing unbound serum 
components by washing, the reagent is reacted with reporter-labelled anti-human antibody 
to bind reporter to the reagent in proportion to the amount of bound anti-antigen antibody 
on the solid support. The reagent is again washed to remove unbound labelled antibody, 
and the amount of reporter associated with the reagent is determined. Typically, the 
reporter is an enzyme which is detected by incubating the solid phase in the presence of a 
suitable fluorometric or colorimetric substrate (Sigma, St. Louis, MO). 

The solid surface reagent in the above assay is prepared by known techniques for 
attaching protein material to solid support material, such as polymeric beads, dip sticks, 
96-well plate or filter material. These attachment methods generally include non-specific 
adsorption of the protein to the support or covalent attachment of the protein, typically 
through a free amine group, to a chemically reactive group on the solid support, such as an 
activated carboxyl, hydroxyl, or aldehyde group. Alternatively, streptavidin coated plates 
can be used in conjunction with biotinylated antigen(s). 

Thus, the invention provieds an assay system or kit for carrying out this diagnostic 
method. The kit generally includes a support with surface-bound recombinant antigens, 
and a reporter-labelled anti -human antibody for detecting surface-bound anti-antigen 
antibody. 

Fusion Proteins 

Any polypeptide of the present invention can be used to generate fusion proteins. 
For example, the polypeptide of the present invention, when fused to a second protein, can 
be used as an antigenic tag. Antibodies raised against the polypeptide of the present 
invention can be used to indirectly detect the second protein by binding to the polypeptide. 
Moreover, because secreted proteins target cellular locations based on trafficking signals, 
the polypeptides of the present invention can be used as targeting molecules once fused to 
other proteins. 
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Examples of domains that can be fused to polypeptides of the present invention 
include not only heterologous signal sequences, but also other heterologous functional 
regions. The fusion does not necessarily need to be direct, but may occur through linker 
sequences. 

Moreover, fusion proteins may also be engineered to improve characteristics of the 
polypeptide of the present invention. For instance, a region of additional amino acids, 
particularly charged amino acids, may be added to the N -terminus of the polypeptide to 
improve stability and persistence during purification from the host cell or subsequent 
handling and storage. Also, peptide moieties may be added to the polypeptide to facilitate 
purification. Such regions may be removed prior to final preparation of the polypeptide. 
The addition of peptide moieties to facilitate handling of polypeptides are familiar and 
routine techniques in the art. 

Moreover, polypeptides of the present invention, including fragments, and 
specifically epitopes, can be combined with parts of the constant domain of 
immunoglobulins (IgA, IgE, IgG, IgM) or portions thereof (CHI, CH2, CH3, and any 
combination thereof, including both entire domains and portions thereof), resulting in 
chimeric polypeptides. These fusion proteins facilitate purification and show an increased 
half-life in vivo. One reported example describes chimeric proteins consisting of the first 
two domains of the human CD4-poly peptide and various domains of the constant regions 
of the heavy or light chains of mammalian immunoglobulins. (EP A 394,827; Traunecker 
et a!., Nature 331:84-86 (1988).) Fusion proteins having disulfide-linked dimeric 
structures (due to the IgG) can also be more efficient in binding and neutralizing other 
molecules, than the monomeric secreted protein or protein fragment alone. (Fountoulakis 
et al., J. Biochem. 270:3958-3964 (1995).) 

Similarly, EP-A-O 464 533 (Canadian counterpart 2045869) discloses fusion 
proteins comprising various portions of constant region of immunoglobulin molecules 
together with another human protein or part thereof. In many cases, the Fc part in a fusion 
protein is beneficial in therapy and diagnosis, and thus can result in, for example, improved 
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pharmacokinetic properties. (EP-A 0232 262.) Alternatively, deleting the Fc part after the 
fusion protein has been expressed, detected, and purified, would be desired. For example, 
the Fc portion may hinder therapy and diagnosis if the fusion protein is used as an antigen 
for immunizations. In drug discovery, for example, human proteins, such as hIL-5, have 
been fused with Fc portions for the purpose of high -throughput screening assays to identify 
antagonists of hIL-5. (See, D. Bennett et al, J. Molecular Recognition 8:52-58 (1995); K. 
Johanson et al., J. Biol. Chem. 270:9459-9471 (1995).) 

Moreover, the polypeptides of the present invention can be fused to marker 
sequences, such as a peptide which facilitates purification of the fused polypeptide. In 
preferred embodiments, the marker amino acid sequence is a hexa-histidine peptide, such 
as the tag provided in a pQE vector (QIAGEN, Inc., 9259 Eton Avenue, Chatsworth, CA, 
91311), among others, many of which are commercially available. As described in Gentz 
et al., Proc. Natl. Acad. Sci. USA 86:821-824 (1989), for instance, hexa-histidine provides 
for convenient purification of the fusion protein. Another peptide tag useful for 
purification, the "HA" tag, corresponds to an epitope derived from the influenza 
hemagglutinin protein. (Wilson et al., Cell 37:767 (1984).) 

Thus, any of these above fusions can be engineered using the polynucleotides or 
the polypeptides of the present invention. 


Vectors. Host Cells, and Protein Production 

The present invention also relates to vectors containing the polynucleotide of the 
present invention, host cells, and the production of polypeptides by recombinant 
techniques. The vector may be, for example, a phage, plasmid, viral, or retroviral vector. 
Retroviral vectors may be replication competent or replication defective. In the latter case, 
viral propagation generally will occur only in complementing host cells. 
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The polynucleotides may be joined to a vector containing a selectable marker for 
propagation in a host. Generally, a plasmid vector is introduced in a precipitate, such as a 
calcium phosphate precipitate, or in a complex with a charged lipid. If the vector is a 
virus, it may be packaged in vitro using an appropriate packaging cell line and then 
transduced into host cells. 

The polynucleotide insert should be operatively linked to an appropriate promoter, 
such as the phage lambda PL promoter, the E. coli lac, trp, phoA and tac promoters, the 
SV40 early and late promoters and promoters of retroviral LTRs, to name a few. Other 
suitable promoters will be known to the skilled artisan. The expression constructs will 
further contain sites for transcription initiation, termination, and, in the transcribed region, 
a ribosome binding site for translation. The coding portion of the transcripts expressed by 
the constructs will preferably include a translation initiating codon at the beginning and a 
termination codon (UAA, UGA or UAG) appropriately positioned at the end of the 
polypeptide to be translated. 

As indicated, the expression vectors will preferably include at least one selectable 
marker. Such markers include dihydrofolate reductase, G418 or neomycin resistance for 
eukaryotic cell culture and tetracycline, kanamycin or ampicillin resistance genes for 
culturing in E. coli and other bacteria. Representative examples of appropriate hosts 
include, but are not limited to, bacterial cells, such as E. coli, Streptomyces and Salmonella 
typhimurium cells; fungal cells, such as yeast cells; insect cells such as Drosophila S2 and 
Spodoptera Sf9 cells; animal cells such as CHO, COS, 293, and Bowes melanoma cells; 
and plant cells. Appropriate culture mediums and conditions for the above-described host 
cells are known in the art. 

Among vectors preferred for use in bacteria include pQE70, pQE60 and pQE-9, 
available from QIAGEN, Inc.; pBluescript vectors, Phagescript vectors, pNH8A, pNH16a, 
pNH18A, pNH46A, available from Stratagene Cloning Systems, Inc.; and ptrc99a t 
pKK223-3, pKK233-3, pDR540, pRIT5 available from Pharmacia Biotech, Inc. Among 
preferred eukaryotic vectors are pWLNEO, pSV2CAT, pOG44, pXTl and pSG available 
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from Stratagene; and pSVK3, pBPV, pMSG and pSVL available from Pharmacia. Other 
suitable vectors will be readily apparent to the skilled artisan. 

Introduction of the construct into the host cell can be effected by calcium phosphate 
transfection, DEAE-dextran mediated transfection, cationic lipid-mediated transfection, 
electroporation, transduction, infection, or other methods. Such methods are described in 
many standard laboratory manuals, such as Davis et al., Basic Methods In Molecular 
Biology (1986). It is specifically contemplated that the polypeptides of the present 
invention may in fact be expressed by a host cell lacking a recombinant vector. 

A polypeptide of this invention can be recovered and purified from recombinant 
cell cultures by well-known methods including ammonium sulfate or ethanol precipitation, 
acid extraction, anion or cation exchange chromatography, phosphocellulose 
chromatography, hydrophobic interaction chromatography, affinity chromatography, 
hydroxy! apatite chromatography and lectin chromatography. Most preferably, high 
performance liquid chromatography ("HPLC") is employed for purification. 

Polypeptides of the present invention, and preferably the secreted form, can also be 
recovered from: products purified from natural sources, including bodily fluids, tissues 
and cells, whether directly isolated or cultured; products of chemical synthetic procedures; 
and products produced by recombinant techniques from a prokaryotic or eukaryotic host, 
including, for example, bacterial, yeast, higher plant, insect, and mammalian cells. 
Depending upon the host employed in a recombinant production procedure, the 
polypeptides of the present invention may be glycosylated or may be non-glycosylated. In 
addition, polypeptides of the invention may also include an initial modified methionine 
residue, in some cases as a result of host-mediated processes. Thus, it is well known in the 
art that the N-terminal methionine encoded by the translation initiation codon generally is 
removed with high efficiency from any protein after translation in all eukaryotic cells. 
While the N-terminal methionine on most proteins also is efficiently removed in most 
prokaryotes, for some proteins, this prokaryotic removal process is inefficient, depending 
on the nature of the amino acid to which the N-terminal methionine is covalently linked. 
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In addition to encompassing host cells containing the vector constructs discussed 
herein, the invention also encompasses primary, secondary, and immortalized host cells of 
vertebrate origin, particularly mammalian origin, that have been engineered to delete or 
replace endogenous genetic material (e.g., coding sequence), and/or to include genetic 
material (e.g., heterologous polynucleotide sequences) that is operably associated with the 
polynucleotides of the invention, and which activates, alters, and/or amplifies endogenous 
polynucleotides. For example, techniques known in the art may be used to operably 
associate heterologous control regions (e.g., promoter and/or enhancer) and endogenous 
polynucleotide sequences via homologous recombination (see, e.g., U.S. Patent No. 
5,641,670, issued June 24, 1997; International Publication No. WO 96/29411, published 
September 26, 1996; International Publication No. WO 94/12650, published August 4, 
1994; Kolier et al., Proc. Natl. Acad. Sci. USA 86:8932-8935 (1989); and Zijlstra et al., 
Nature 342:435-438 (1989), the disclosures of each of which are incorporated by reference 
in their entireties). 

In addition, polypeptides of the invention can be chemically synthesized using 
techniques known in the art (e.g., see Creighton, 1983, Proteins: Structures and Molecular 
Principles, W.H. Freeman & Co., N.Y., and Hunkapiller et al., Nature, 310:105-111 
(1984)). For example, a polypeptide corresponding to a fragment of a polypeptide 
sequence of the invention can be synthesized by use of a peptide synthesizer. Furthermore, 
if desired, nonclassical amino acids or chemical amino acid analogs can be introduced as a 
substitution or addition into the polypeptide sequence. Non-classical amino acids include, 
but are not limited to, to the D-isomers of the common amino acids, 2,4-diaminobutyric 
acid, a-amino isobutyric acid, 4-aminobutyric acid, Abu, 2-amino butyric acid, g-Abu, 
e-Ahx, 6-amino hexanoic acid, Aib, 2-amino isobutyric acid, 3-amino propionic acid, 
ornithine, norleucine, norvaline, hydroxyproline, sarcosine, citrulline, homocitrulline, 
cysteic acid, t-butylglycine, t-butyl alanine, phenylglycine, cyclohexylalanine, b-alanine, 
fluoro-amino acids, designer amino acids such as b-methyl amino acids, Ca-methyl amino 
acids, Na-methyl amino acids, and amino acid analogs in general. Furthermore, the amino 
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acid can be D (dextrorotary) or L (levorotary). 

The invention encompasses polypeptides which are differentially modified during 
or after translation, e.g., by glycosylation, acetylation, phosphorylation, amidation, 
derivatization by known protecting/blocking groups, proteolytic cleavage, linkage to an 
antibody molecule or other cellular ligand, etc. Any of numerous chemical modifications 
may be carried out by known techniques, including but not limited, to specific chemical 
cleavage by cyanogen bromide, trypsin, chymotrypsin, papain, V8 protease, NaBH 4 ; 
acetylation, formylation, oxidation, reduction; metabolic synthesis in the presence of 
tunicamycin; etc. 

Additional post-translational modifications encompassed by the invention include, 
for example, e.g., N-linked or O-Iinked carbohydrate chains, processing of N-terminal or 
C-terminal ends), attachment of chemical moieties to the amino acid backbone, chemical 
modifications of N-linked or O-linked carbohydrate chains, and addition or deletion of an 
N-terminal methionine residue as a result of procaryotic host cell expression. The 
polypeptides may also be modified with a detectable label, such as an enzymatic, 
fluorescent, isotopic or affinity label to allow for detection and isolation of the protein. 

Also provided by the invention are chemically modified derivatives of the 
polypeptides of the invention which may provide additional advantages such as increased 
solubility, stability and circulating time of the polypeptide, or decreased immunogenicity 
(see U.S. Patent NO: 4,179,337). The chemical moieties for derivitization may be selected 
from water soluble polymers such as polyethylene glycol, ethylene glycol/propylene glycol 
copolymers, carboxymethylcellulose, dextran, polyvinyl alcohol and the like. The 
polypeptides may be modified at random positions within the molecule, or at 
predetermined positions within the molecule and may include one, two, three or more 
attached chemical moieties. 

The polymer may be of any molecular weight, and may be branched or unbranched. 
For polyethylene glycol, the preferred molecular weight is between about 1 kDa and about 
100 kDa (the term "about" indicating that in preparations of polyethylene glycol, some 
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molecules will weigh more, some less, than the stated molecular weight) for ease in 
handling and manufacturing. Other sizes may be used, depending on the desired 
therapeutic profile (e.g., the duration of sustained release desired, the effects, if any on 
biological activity, the ease in handling, the degree or lack of antigenicity and other known 
effects of the polyethylene glycol to a therapeutic protein or analog). 

The polyethylene glycol molecules (or other chemical moieties) should be attached 
to the protein with consideration of effects on functional or antigenic domains of the 
protein. There are a number of attachment methods available to those skilled in the art, 
e.g., EP 0 401 384, herein incorporated by reference (coupling PEG to G-CSF), see also 
Malik et ah, Exp. Hematol. 20:1028-1035 (1992) (reporting pegylation of GM-CSF using 
tresyl chloride). For example, polyethylene glycol may be covalently bound through 
amino acid residues via a reactive group, such as, a free amino or carboxyl group. 
Reactive groups are those to which an activated polyethylene glycol molecule may be 
bound. The amino acid residues having a free amino group may include lysine residues 
and the N-terminal amino acid residues; those having a free carboxyl group may include 
aspartic acid residues glutamic acid residues and the C-terminal amino acid residue. 
Sulfhydryl groups may also be used as a reactive group for attaching the polyethylene 
glycol molecules. Preferred for therapeutic purposes is attachment at an amino group, such 
as attachment at the N-terminus or lysine group. 

One may specifically desire proteins chemically modified at the N-terminus. Using 
polyethylene glycol as an illustration of the present composition, one may select from a 
variety of polyethylene glycol molecules (by molecular weight, branching, etc.), the 
proportion of polyethylene glycol molecules to protein (polypeptide) molecules in the 
reaction mix, the type of pegylation reaction to be performed, and the method of obtaining 
the selected N-terminally pegylated protein. The method of obtaining the N-terminal ly 
pegylated preparation (i.e., separating this moiety from other monopegylated moieties if 
necessary) may be by purification of the N-terminal ly pegylated material from a 
population of pegylated protein molecules. Selective proteins chemically modified at the 
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N-terminus modification may be accomplished by reductive alkylation which exploits 
differential reactivity of different types of primary amino groups (lysine versus the 
N-terminal) available for derivatization in a particular protein. Under the appropriate 
reaction conditions, substantially selective derivatization of the protein at the N-terminus 
with a carbonyl group containing polymer is achieved. 

The polypeptides of the invention may be in monomers or mu (timers (i.e., dimers, 
trimers, tetramers and higher multimers). Accordingly, the present invention relates to 
monomers and multimers of the polypeptides of the invention, their preparation, and 
compositions (preferably, Therapeutics) containing them. In specific embodiments, the 
polypeptides of the invention are monomers, dimers, trimers or tetramers. In additional 
embodiments, the multimers of the invention are at least dimers, at least trimers, or at least 
tetramers. 

Multimers encompassed by the invention may be homomers or heteromers. As 
used herein, the term homomer, refers to a multimer containing only polypeptides 
corresponding to the amino acid sequence of SEQ ID NO: Y or encoded by the cDNA 
contained in a deposited clone (including fragments, variants, splice variants, and fusion 
proteins, corresponding to these polypeptides as described herein). These homomers may 
contain polypeptides having identical or different amino acid sequences. In a specific 
embodiment, a homomer of the invention is a multimer containing only polypeptides 
having an identical amino acid sequence. In another specific embodiment, a homomer of 
the invention is a multimer containing polypeptides having different amino acid sequences. 
In specific embodiments, the multimer of the invention is a homodimer (e.g., containing 
polypeptides having identical or different amino acid sequences) or a homotrimer (e.g., 
containing polypeptides having identical and/or different amino acid sequences). In 
additional embodiments, the homomeric multimer of the invention is at least a homodimer, 
at least a homotrimer, or at least a homotetramer. 

As used herein, the term heteromer refers to a multimer containing one or more 
heterologous polypeptides (i.e., polypeptides of different proteins) in addition to the 
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polypeptides of the invention. In a specific embodiment, the multimer of the invention is a 
heterodimer, a heterotrimer, or a heterotetramer. In additional embodiments, the 
heteromeric multimer of the invention is at least a heterodimer, at least a heterotrimer, or at 
least a heterotetramer. 

5 Mul timers of the invention may be the result of hydrophobic, hydrophilic, ionic 

and/or covalent associations and/or may be indirectly linked, by for example, liposome 
formation. Thus, in one embodiment, multimers of the invention, such as, for example, 
homodimers or homotrimers, are formed when polypeptides of the invention contact one 
another in solution. In another embodiment, heteromul timers of the invention, such as, for 

10 example, heterotrimers or heterotetramers, are formed when polypeptides of the invention 
contact antibodies to the polypeptides of the invention (including antibodies to the 
heterologous polypeptide sequence in a fusion protein of the invention) in solution. In 
other embodiments, multimers of the invention are formed by covalent associations with 
and/or between the polypeptides of the invention. Such covalent associations may involve 

15 one or more amino acid residues contained in the polypeptide sequence ( e.g., that recited 
in the sequence listing, or contained in the polypeptide encoded by a deposited clone). In 
one instance, the covalent associations are cross-linking between cysteine residues located 
within the polypeptide sequences which interact in the native (i.e., naturally occurring) 
polypeptide. In another instance, the covalent associations are the consequence of 

20 chemical or recombinant manipulation. Alternatively, such covalent associations may 
involve one or more amino acid residues contained in the heterologous polypeptide 
sequence in a fusion protein of the invention. 

In one example, covalent associations are between the heterologous sequence 
contained in a fusion protein of the invention (see, e.g., US Patent Number 5,478,925). In 

25 a specific example, the covalent associations are between the heterologous sequence 

contained in an Fc fusion protein of the invention (as described herein). In another specific 
example, covalent associations of fusion proteins of the invention are between 
heterologous polypeptide sequence from another protein that is capable of forming 
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covalently associated mul timers, such as for example, oseteoprotegerin (see, e.g., 
International Publication NO: WO 98/49305, the contents of which are herein incorporated 
by reference in its entirety). In another embodiment, two or more polypeptides of the 
invention are joined through peptide linkers. Examples include those peptide linkers 
5 described in U.S. Pat. No. 5,073,627 (hereby incorporated by reference). Proteins 

comprising multiple polypeptides of the invention separated by peptide linkers may be 
produced using conventional recombinant DNA technology. 

Another method for preparing multimer polypeptides of the invention involves use 
of polypeptides of the invention fused to a leucine zipper or isoleucine zipper polypeptide 

10 sequence. Leucine zipper and isoleucine zipper domains are polypeptides that promote 
multimerization of the proteins in which they are found. Leucine zippers were originally 
identified in several DNA-binding proteins (Landschulz et ah, Science 240:1759, (1988)), 
and have since been found in a variety of different proteins. Among the known leucine 
zippers are naturally occurring peptides and derivatives thereof that dimerize or trimerize. 

15 Examples of leucine zipper domains suitable for producing soluble multimeric proteins of 
the invention are those described in PCT application WO 94/10308, hereby incorporated 
by reference. Recombinant fusion proteins comprising a polypeptide of the invention 
fused to a polypeptide sequence that dimerizes or trimerizes in solution are expressed in 
suitable host cells, and the resulting soluble multimeric fusion protein is recovered from 

20 the culture supernatant using techniques known in the art. 

Trimeric polypeptides of the invention may offer the advantage of enhanced 
biological activity. Preferred leucine zipper moieties and isoleucine moieties are those that 
preferentially form trimers. One example is a leucine zipper derived from lung surfactant 
protein D (SPD), as described in Hoppe et al. (FEBS Letters 344:191, (1994)) and in U.S. 

25 patent application Ser. No. 08/446,922, hereby incorporated by reference. Other peptides 
derived from naturally occurring trimeric proteins may be employed in preparing trimeric 
polypeptides of the invention. 
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In another example, proteins of the invention are associated by interactions 
between Flag® polypeptide sequence contained in fusion proteins of the invention 
containing Flag® polypeptide seuqence. In a further embodiment, associations proteins of 
the invention are associated by interactions between heterologous polypeptide sequence 
contained in Flag® fusion proteins of the invention and anti-Flag® antibody. 

The multimers of the invention may be generated using chemical techniques known 
in the art. For example, polypeptides desired to be contained in the multimers of the 
invention may be chemically cross-linked using linker molecules and linker molecule 
length optimization techniques known in the art (see, e.g., US Patent Number 5,478,925, 
which is herein incorporated by reference in its entirety). Additionally, multimers of the 
invention may be generated using techniques known in the art to form one or more inter- 
molecule cross-links between the cysteine residues located within the sequence of the 
polypeptides desired to be contained in the multimer (see, e.g., US Patent Number 
5,478,925, which is herein incorporated by reference in its entirety). Further, polypeptides 
of the invention may be routinely modified by the addition of cysteine or biotin to the C 
terminus or N-terminus of the polypeptide and techniques known in the art may be applied 
to generate multimers containing one or more of these modified polypeptides (see, e.g., US 
Patent Number 5,478,925, which is herein incorporated by reference in its entirety). 
Additionally, techniques known in the art may be applied to generate liposomes containing 
the polypeptide components desired to be contained in the multimer of the invention (see, 
e.g., US Patent Number 5,478,925, which is herein incorporated by reference in its 
entirety). 

Alternatively, multimers of the invention may be generated using genetic 
engineering techniques known in the art. In one embodiment, polypeptides contained in 
multimers of the invention are produced recombinantly using fusion protein technology 
described herein or otherwise known in the art (see, e.g., US Patent Number 5,478,925, 
which is herein incorporated by reference in its entirety). In a specific embodiment, 
polynucleotides coding for a homodimer of the invention are generated by ligating a 
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polynucleotide sequence encoding a polypeptide of the invention to a sequence encoding a 
linker polypeptide and then further to a synthetic polynucleotide encoding the translated 
product of the polypeptide in the reverse orientation from the original C-terminus to the N- 
terminus (lacking the leader sequence) (see, e.g., US Patent Number 5,478,925, which is 
herein incorporated by reference in its entirety). In another embodiment, recombinant 
techniques described herein or otherwise known in the art are applied to generate 
recombinant polypeptides of the invention which contain a transmembrane domain (or 
hyrophobic or signal peptide) and which can be incorporated by membrane reconstitution 
techniques into liposomes (see, e.g., US Patent Number 5,478,925, which is herein 
incorporated by reference in its entirety). 

Uses of t he Polynucleotides 

Each of the polynucleotides identified herein can be used in numerous ways as 
reagents. The following description should be considered exemplary and utilizes known 
techniques. 

The polynucleotides of the present invention are useful for chromosome 
identification. There exists an ongoing need to identify new chromosome markers, since 
few chromosome marking reagents, based on actual sequence data (repeat 
polymorphisms), are presently available. Each polynucleotide of the present invention can 
be used as a chromosome marker. 

Briefly, sequences can be mapped to chromosomes by preparing PCR primers 
(preferably 15-25 bp) from the sequences shown in SEQ ID NO:X. Primers can be 
selected using computer analysis so that primers do not span more than one predicted exon 
in the genomic DNA. These primers are then used for PCR screening of somatic cell 
hybrids containing individual human chromosomes. Only those hybrids containing the 
human gene corresponding to the SEQ ID NO:X will yield an amplified fragment. 

Similarly, somatic hybrids provide a rapid method of PCR mapping the 
polynucleotides to particular chromosomes. Three or more clones can be assigned per day 
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using a single thermal cycler. Moreover, sublocalization of the polynucleotides can be 
achieved with panels of specific chromosome fragments. Other gene mapping strategies 
that can be used include in situ hybridization, prescreening with labeled flow-sorted 
chromosomes, and preselection by hybridization to construct chromosome specific-cDNA 
libraries. 

Precise chromosomal location of the polynucleotides can also be achieved using 
fluorescence in situ hybridization (FISH) of a metaphase chromosomal spread. This 
technique uses polynucleotides as short as 500 or 600 bases; however, polynucleotides 
2,000-4,000 bp are preferred. For a review of this technique, see Verma et al., "Human 
Chromosomes: a Manual of Basic Techniques," Pergamon Press, New York (1988). 

For chromosome mapping, the polynucleotides can be used individually (to mark a 
single chromosome or a single site on that chromosome) or in panels (for marking multiple 
sites and/or multiple chromosomes). Preferred polynucleotides correspond to the 
noncoding regions of the cDNAs because the coding sequences are more likely conserved 
within gene families, thus increasing the chance of cross hybridization during 
chromosomal mapping. 

Once a polynucleotide has been mapped to a precise chromosomal location, the 
physical position of the polynucleotide can be used in linkage analysis. Linkage analysis 
establishes coinheritance between a chromosomal location and presentation of a particular 
disease. (Disease mapping data are found, for example, in V. McKusick, Mendelian 
Inheritance in Man (available on line through Johns Hopkins University Welch Medical 
Library) .) Assuming 1 megabase mapping resolution and one gene per 20 kb, a cDNA 
precisely localized to a chromosomal region associated with the disease could be one of 
50-500 potential causative genes. 

Thus, once coinheritance is established, differences in the polynucleotide and the 
corresponding gene between affected and unaffected individuals can be examined. First, 
visible structural alterations in the chromosomes, such as deletions or translocations, are 
examined in chromosome spreads or by PCR. If no structural alterations exist, the 
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presence of point mutations are ascertained. Mutations observed in some or all affected 
individuals, but not in normal individuals, indicates that the mutation may cause the 
disease. However, complete sequencing of the polypeptide and the corresponding gene 
from several normal individuals is required to distinguish the mutation from a 
polymorphism. If a new polymorphism is identified, this polymorphic polypeptide can be 
used for further linkage analysis. 

Furthermore, increased or decreased expression of the gene in affected individuate 
as compared to unaffected individuals can be assessed using polynucleotides of the present 
invention. Any of these alterations (altered expression, chromosomal rearrangement, or 
mutation) can be used as a diagnostic or prognostic marker. 

Thus, the invention also provides a diagnostic method useful during diagnosis of a 
disorder, involving measuring the expression level of polynucleotides of the present 
invention in cells or body fluid from an individual and comparing the measured gene 
expression level with a standard level of polynucleotide expression level, whereby an 
increase or decrease in the gene expression level compared to the standard is indicative of 
a disorder. 

In still another embodiment, the invention includes a kit for analyzing samples for 
the presence of proliferative and/or cancerous polynucleotides derived from a test subject. 
In a general embodiment, the kit includes at least one polynucleotide probe containing a 
nucleotide sequence that will specifically hybridize with a polynucleotide of the present 
invention and a suitable container. In a specific embodiment, the kit includes two 
polynucleotide probes defining an internal region of the polynucleotide of the present 
invention, where each probe has one strand containing a 31'mer-end internal to the region. 
In a further embodiment, the probes may be useful as primers for polymerase chain 
reaction amplification. 

Where a diagnosis of a disorder, has already been made according to conventional 
methods, the present invention is useful as a prognostic indicator, whereby patients 
exhibiting enhanced or depressed polynucleotide of the present invention expression will 
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experience a worse clinical outcome relative to patients expressing the gene at a level 
nearer the standard level. 

By "measuring the expression level of polynucleotide of the present invention" is 
intended qualitatively or quantitatively measuring or estimating the level of the 
polypeptide of the present invention or the level of the mRNA encoding the polypeptide in 
a first biological sample either directly (e.g., by determining or estimating absolute protein 
level or mRNA ievel) or relatively (e.g., by comparing to the polypeptide level or mRNA 
level in a second biological sample). Preferably, the polypeptide level or mRNA level in 
the first biological sample is measured or estimated and compared to a standard 
polypeptide level or mRNA level, the standard being taken from a second biological 
sample obtained from an individual not having the disorder or being determined by 
averaging levels from a population of individuals not having a disorder. As will be 
appreciated in the art, once a standard polypeptide level or mRNA level is known, it can be 
used repeatedly as a standard for comparison. 

By "biological sample" is intended any biological sample obtained from an 
individual, body fluid, cell line, tissue culture, or other source which contains the 
polypeptide of the present invention or mRNA. As indicated, biological samples include 
body fluids (such as semen, lymph, sera, plasma, urine, synovial fluid and spinal fluid) 
which contain the polypeptide of the present invention, and other tissue sources found to 
express the polypeptide of the present invention. Methods for obtaining tissue biopsies 
and body fluids from mammals are well known in the art. Where the biological sample is 
to include mRNA, a tissue biopsy is the preferred source. 

The method(s) provided above may preferrably be applied in a diagnostic method 
and/or kits in which polynucleotides and/or polypeptides are attached to a solid support. In 
one exemplary method, the support may be a "gene chip" or a "biological chip" as 
described in US Patents 5,837,832, 5,874,219, and 5,856,174. Further, such a gene chip 
with polynucleotides of the present invention attached may be used to identify 
polymorphisms between the polynucleotide sequences, with polynucleotides isolated from 
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a test subject. The knowledge of such polymorphisms (i.e. their location, as well as, their 
existence) would be beneficial in identifying disease loci for many disorders, including 
cancerous diseases and conditions. Such a method is described in US Patents 5,858,659 
and 5,856,104. The US Patents referenced supra are hereby incorporated by reference in 
their entirety herein. 

The present invention encompasses polynucleotides of the present invention that 
are chemically synthesized, or reproduced as peptide nucleic acids (PNA), or according to 
other methods known in the art. The use of PNAs would serve as the preferred form if the 
polynucleotides are incorporated onto a solid support, or gene chip. For the purposes of the 
present invention, a peptide nucleic acid (PNA) is a polyamide type of DNA analog and 
the monomeric units for adenine, guanine, thymine and cytosine are available 
commercially (Perceptive Biosystems). Certain components of DNA, such as phosphorus, 
phosphorus oxides, or deoxyribose derivatives, are not present in PNAs. As disclosed by P. 
E. Nielsen, M. Egholm, R. H. Berg and O. Buchardt, Science 254, 1497 (1991); and M. 
Egholm, O. Buchardt, L.Christensen, C. Behrens, S. M. Freier, D. A. Driver, R. H. Berg, 
S. K. Kim, B. Norden, and P. E. Nielsen, Nature 365, 666 (1993), PNAs bind specifically 
and tightly to complementary DNA strands and are not degraded by nucleases. In fact, 
PNA binds more strongly to DNA than DNA itself does. This is probably because there is 
no electrostatic repulsion between the two strands, and also the polyamide backbone is 
more flexible. Because of this, PNA/DNA duplexes bind under a wider range of stringency 
conditions than DNA/DNA duplexes, making it easier to perform multiplex hybridization. 
Smaller probes can be used than with DNA due to the strong binding. In addition, it is 
more likely that single base mismatches can be determined with PNA/DNA hybridization 
because a single mismatch in a PNA/DNA 15-mer lowers the melting point (T.sub.m) by 
8°-20° C, vs. 4°-16° C for the DNA/DNA 15-mer duplex. Also, the absence of charge 
groups in PNA means that hybridization can be done at low ionic strengths and reduce 
possible interference by salt during the analysis. 
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The present invention is useful for detecting cancer in mammals. In particular the 
invention is useful during diagnosis of pathological cell proliferative neoplasias which 
include, but are not limited to: acute myelogenous ieukemias including acute monocytic 
leukemia, acute .myeloblasts leukemia, acute promyelocyte leukemia, acute 
myelomonocytic leukemia, acute erythroleukemia, acute megakaryocyte leukemia, and 
acute undifferentiated leukemia, etc.; and chronic myelogenous Ieukemias including 
chronic myelomonocytic leukemia, chronic granulocytic leukemia, etc. Preferred 
mammals include monkeys, apes, cats, dogs, cows, pigs, horses, rabbits and humans. 
Particularly preferred are humans. 

Pathological cell proliferative disorders are often associated with inappropriate 
activation of proto-oncogenes. (Gelmann, E. P. et al., 'The Etiology of Acute Leukemia: 
Molecular Genetics and Viral Oncology," in Neoplastic Diseases of the Blood, Vol 1., 
Wiernik, P. H. et al. eds., 161-182 (1985)). Neoplasias are now believed to result from the 
qualitative alteration of a normal cellular gene product, or from the quantitative 
modification of gene expression by insertion into the chromosome of a viral sequence, by 
chromosomal translocation of a gene to a more actively transcribed region, or by some 
other mechanism. (Gelmann et al., supra) It is likely that mutated or altered expression of 
specific genes is involved in the pathogenesis of some Ieukemias, among other tissues and 
cell types. (Gelmann et al., supra) Indeed, the human counterparts of the oncogenes 
involved in some animal neoplasias have been amplified or translocated in some cases of 
human leukemia and carcinoma. (Gelmann et al., supra) 

For example, c-myc expression is highly amplified in the non- lymphocytic leukemia cell 
line HL-60. When HL-60 cells are chemically induced to stop proliferation, the level of c- 
myc is found to be downregulated. (International Publication Number WO 91/15580) 
However, it has been shown that exposure of HL-60 cells to a DNA construct that is 
complementary to the 5' end of c-myc or c-myb blocks translation of the corresponding 
mRNAs which downregulates expression of the c-myc or c-myb proteins and causes arrest 
of cell proliferation and differentiation of the treated cells. (International Publication 
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Number WO 91/15580; Wickstrom et al., Proc. Natl. Acad. Sci. 85:1028 (1988); Anfossi 
et al., Proc. Natl. Acad. Sci. 86:3379 (1989)). However, the skilled artisan would 
appreciate the present invention's usefulness would not be limited to treatment of 
proliferative disorders of hematopoietic cells and tissues, in light of the numerous cells and 
cell types of varying origins which are known to exhibit proliferative phenotypes. 

In addition to the foregoing, a polynucleotide can be used to control gene 
expression through triple helix formation or antisense DNA or RNA. Antisense techniques 
are discussed, for example, in Okano, J. Neurochem. 56: 560 (1991); 
"Oligodeoxynucleotides as Antisense Inhibitors of Gene Expression,CRCPress, Boca 
Raton, FL (1988). Triple helix formation is discussed in, for instance Lee et al., Nucleic 
Acids Research 6: 3073 (1979); Cooney et ah, Science 241: 456 (1988); and Dervan et al., 
Science 251: 1360 (1991). Both methods rely on binding of the polynucleotide to a 
complementary DNA or RNA. For these techniques, preferred polynucleotides are usually 
oligonucleotides 20 to 40 bases in length and complementary to either the region of the 
gene involved in transcription (triple helix - see Lee et al., Nucl. Acids Res. 6:3073 (1979); 
Cooney et al., Science 241:456 (1988); and Dervan et al., Science 251:1360 (1991) ) or to 
the mRNA itself (antisense - Okano, J. Neurochem. 56:560 (1991); Oligodeoxy- 
nucleotides as Antisense Inhibitors of Gene Expression, CRC Press, Boca Raton, FL 
(1988).) Triple helix formation optimally results in a shut-off of RNA transcription from 
DNA, while antisense RNA hybridization blocks translation of an mRNA molecule into 
polypeptide. Both techniques are effective in model systems, and the information 
disclosed herein can be used to design antisense or triple helix polynucleotides in an effort 
to treat disease. 

Polynucleotides of the present invention are also useful in gene therapy. One goal 
of gene therapy is to insert a normal gene into an organism having a defective gene, in an 
effort to correct the genetic defect. The polynucleotides disclosed in the present invention 
offer a means of targeting such genetic defects in a highly accurate manner. Another goal 
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is to insert a new gene that was not present in the host genome, thereby producing a new 
trait in the host cell. 

The polynucleotides are also useful for identifying individuals from minute 
biological samples. The United States military, for example, is considering the use of 
restriction fragment length polymorphism (RFLP) for identification of its personnel. In 
this technique, an individual's genomic DNA is digested with one or more restriction 
enzymes, and probed on a Southern blot to yield unique bands for identifying personnel. 
This method does not suffer from the current limitations of "Dog Tags" which can be lost, 
switched, or stolen, making positive identification difficult. The polynucleotides of the 
present invention can be used as additional DNA markers for RFLP. 

The polynucleotides of the present invention can also be used as an alternative to 
RFLP, by determining the actual base-by-base DNA sequence of selected portions of an 
individual's genome. These sequences can be used to prepare PCR primers for amplifying 
and isolating such selected DNA, which can then be sequenced. Using this technique, 
individuals can be identified because each individual will have a unique set of DNA 
sequences. Once an unique ID database is established for an individual, positive 
identification of that individual, living or dead, can be made from extremely small tissue 
samples. 

Forensic biology also benefits from using DNA-based identification techniques as 
disclosed herein. DNA sequences taken from very small biological samples such as 
tissues, e.g., hair or skin, or body fluids, e.g., blood, saliva, semen, synovial fluid, amniotic 
fluid, breast milk, lymph, pulmonary sputum or surfactant, urine, fecal matter, etc., can be 
amplified using PCR. In one prior art technique, gene sequences amplified from 
polymorphic loci, such as DQa class II HLA gene, are used in forensic biology to identify 
individuals. (Erlich, H., PCR Technology, Freeman and Co. (1992).) Once these specific 
polymorphic loci are amplified, they are digested with one or more restriction enzymes, 
yielding an identifying set of bands on a Southern blot probed with DNA corresponding to 
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the DQa class II HLA gene. Similarly, polynucleotides of the present invention can be 
used as polymorphic markers for forensic purposes. 

There is also a need for reagents capable of identifying the source of a particular 
tissue. Such need arises, for example, in forensics when presented with tissue of unknown 
5 origin. Appropriate reagents can comprise, for example, DNA probes or primers specific to 
particular tissue prepared from the sequences of the present invention. Panels of such 
reagents can identify tissue by species and/or by organ type. In a similar fashion, these 
reagents can be used to screen tissue cultures for contamination. 

In the very least, the polynucleotides of the present invention can be used as 
10 molecular weight markers on Southern gels, as diagnostic probes for the presence of a 
specific mRNA in a particular cell type, as a probe to "subtract-out" known sequences in 
the process of discovering novel polynucleotides, for selecting and making oligomers for 
attachment to a "gene chip" or other support, to raise anti-DNA antibodies using DNA 
immunization techniques, and as an antigen to elicit an immune response. 

15 

Uses of the Polypeptides 

Each of the polypeptides identified herein can be used in numerous ways. The 
following description should be considered exemplary and utilizes known techniques. 
A polypeptide of the present invention can be used to assay protein levels in a 

20 biological sample using antibody-based techniques. For example, protein expression in 
tissues can be studied with classical immunohistological methods. (Jalkanen, M M et al., J. 
Cell. Biol. 101:976-985 (1985); Jalkanen, M., et al., J. Cell . Biol. 105:3087-3096 (1987).) 
Other antibody-based methods useful for detecting protein gene expression include 
immunoassays, such as the enzyme linked immunosorbent assay (ELISA) and the 

25 radioimmunoassay (RIA). Suitable antibody assay labels are known in the art and include 
enzyme labels, such as, glucose oxidase, and radioisotopes, such as iodine (1251, 1211), 
carbon (14C), sulfur (35S), tritium (3H), indium (112In), and technetium (99mTc), and 
fluorescent labels, such as fluorescein and rhodamine, and biotin. 
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In addition to assaying secreted protein levels in a biological sample, proteins can 
also be detected in vivo by imaging. Antibody labels or markers for in vivo imaging of 
protein include those detectable by X-radiography, NMR or ESR. For X-radiography, 
suitable labels include radioisotopes such as barium or cesium, which emit detectable 
radiation but are not overtly harmful to the subject. Suitable markers for NMR and ESR 
include those with a detectable characteristic spin, such as deuterium, which may be 
incorporated into the antibody by labeling of nutrients for the relevant hybridoma. 

A protein-specific antibody or antibody fragment which has been labeled with an 
appropriate detectable imaging moiety, such as a radioisotope (for example, 1311, 112In, 
99mTc), a radio-opaque substance, or a materia] detectable by nuclear magnetic resonance, 
is introduced (for example, parenteral ly, subcutaneously, or intraperitoneal ly) into the 
mammal. It will be understood in the art that the size of the subject and the imaging 
system used will determine the quantity of imaging moiety needed to produce diagnostic 
images. In the case of a radioisotope moiety, for a human subject, the quantity of 
radioactivity injected will normally range from about 5 to 20 millicuries of 99mTc. The 
labeled antibody or antibody fragment will then preferentially accumulate at the location of 
cells which contain the specific protein. In vivo tumor imaging is described in S.W. 
Burchiel et al., "Immunopharmaco kinetics of Radiolabeled Antibodies and Their 
Fragments." (Chapter 13 in Tumor Imaging: The Radiochemical Detection of Cancer, 
S.W. Burchiel and B. A. Rhodes, eds., Masson Publishing Inc. (1982).) 

Thus, the invention provides a diagnostic method of a disorder, which involves (a) 
assaying the expression of a polypeptide of the present invention in cells or body fluid of 
an individual; (b) comparing the level of gene expression with a standard gene expression 
level, whereby an increase or decrease in the assayed polypeptide gene expression level 
compared to the standard expression level is indicative of a disorder. With respect to 
cancer, the presence of a relatively high amount of transcript in biopsied tissue from an 
individual may indicate a predisposition for the development of the disease, or may 
provide a means for detecting the disease prior to the appearance of actual clinical 
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symptoms. A more definitive diagnosis of this type may allow health professionals to 
employ preventative measures or aggressive treatment earlier thereby preventing the 
development or further progression of the cancer. 

Moreover, polypeptides of the present invention can be used to treat disease. For 
example, patients can be administered a polypeptide of the present invention in an effort 
to replace absent or decreased levels of the polypeptide (e.g., insulin), to supplement 
absent or decreased levels of a different polypeptide (e.g., hemoglobin S for hemoglobin B, 
SOD, catalase, DNA repair proteins), to inhibit the activity of a polypeptide (e.g., an 
oncogene or tumor supressor), to activate the activity of a polypeptide (e.g., by binding to a 
receptor), to reduce the activity of a membrane bound receptor by competing with it for 
free ligand (e.g., soluble TNF receptors used in reducing inflammation), or to bring about 
a desired response (e.g., blood vessel growth inhibition, enhancement of the immune 
response to proliferative cells or tissues). 

Similarly, antibodies directed to a polypeptide of the present invention can also be 
used to treat disease. For example, administration of an antibody directed to a polypeptide 
of the present invention can bind and reduce overproduction of the polypeptide. Similarly, 
administration of an antibody can activate the polypeptide, such as by binding to a 
polypeptide bound to a membrane (receptor). 

At the very least, the polypeptides of the present invention can be used as 
molecular weight markers on SDS-PAGE gels or on molecular sieve gel filtration columns 
using methods well known to those of skill in the art. Polypeptides can also be used to 
raise antibodies, which in turn are used to measure protein expression from a recombinant 
cell, as a way of assessing transformation of the host cell. Moreover, the polypeptides of 
the present invention can be used to test the following biological activities. 
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Gene Therapy Methods 

Another aspect of the present invention is to gene therapy methods for treating 
disorders, diseases and conditions. The gene therapy methods relate to the introduction of 
nucleic acid (DNA, RNA and antisense DNA or RNA) sequences into an animal to achieve 
expression of a polypeptide of the present invention. This method requires a 
polynucleotide which codes for a polypeptide of the invention that operatively linked to a 
promoter and any other genetic elements necessary for the expression of the polypeptide 
by the target tissue. Such gene therapy and delivery techniques are known in the art, see, 
for example, WO9G711092, which is herein incorporated by reference. 

Thus, for example, cells from a patient may be engineered with a polynucleotide 
(DNA or RNA) comprising a promoter operably linked to a polynucleotide of the 
invention ex vivo, with the engineered cells then being provided to a patient to be treated 
with the polypeptide. Such methods are well-known in the art. For example, see 
Beildegrun et al., J. Natl. Cancer Inst., 85:207-216 (1993); Ferrantini et al., Cancer 
Research, 53:107-1112 (1993); Ferrantini et al., J. Immunology 153: 4604-4615 (1994); 
Kaido, T., et al., Int. J. Cancer 60: 221-229 (1995); Ogura et ah, Cancer Research 50: 
5102-5106 (1990); Santodonato, et al., Human Gene Therapy 7:1-10 (1996); Santodonato, 
et al., Gene Therapy 4:1246-1255 (1997); and Zhang, et al., Cancer Gene Therapy 3: 31-38 
(1996)), which are herein incorporated by reference. In one embodiment, the cells which 
are engineered are arterial cells. The arterial cells may be reintroduced into the patient 
through direct injection to the artery, the tissues surrounding the artery, or through catheter 
injection. 

As discussed in more detail below, the polynucleotide constructs can be delivered 
by any method that delivers injectable materials to the cells of an animal, such as, injection 
into the interstitial space of tissues (heart, muscle, skin, lung, liver, and the like). The 
polynucleotide constructs may be delivered in a pharmaceutical ly acceptable liquid or 
aqueous carrier. 
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In one embodiment, the polynucleotide of the invention is delivered as a naked 
polynucleotide. The term "naked" polynucleotide, DNA or RNA refers to sequences that 
are free from any delivery vehicle that acts to assist, promote or facilitate entry into the 
cell, including viral sequences, viral particles, liposome formulations, lipofectin or 
precipitating agents and the like. However, the polynucleotides of the invention can also 
be delivered in liposome formulations and lipofectin formulations and the like can be 
prepared by methods well known to those skilled in the art. Such methods are described, 
for example, in U.S. Patent Nos. 5,593,972, 5,589,466, and 5,580,859, which are herein 
incorporated by reference. 

The polynucleotide vector constructs of the invention used in the gene therapy 
method are preferably constructs that will not integrate into the host genome nor will they 
contain sequences that allow for replication. Appropriate vectors include pWLNEO, 
pSV2CAT, pOG44, pXTl and pSG available from Stratagene; pSVK3, pBPV, pMSG and 
pSVL available from Pharmacia; and pEFl/V5, pcDNA3.1, and pRc/CMV2 available from 
Invitrogen. Other suitable vectors will be readily apparent to the skilled artisan. 

Any strong promoter known to those skilled in the art can be used for driving the 
expression of polynucleotide sequence of the invention. Suitable promoters include 
adenoviral promoters, such as the adenoviral major late promoter; or heterologous 
promoters, such as the cytomegalovirus (CMV) promoter; the respiratory syncytial virus 
(RSV) promoter; inducible promoters, such as the MMT promoter, the metal lothionein 
promoter; heat shock promoters; the albumin promoter; the ApoAI promoter; human 
globin promoters; viral thymidine kinase promoters, such as the Herpes Simplex thymidine 
kinase promoter; retroviral LTRs; the b-actin promoter; and human growth hormone 
promoters. The promoter also may be the native promoter for the polynucleotides of the 
invention. 

Unlike other gene therapy techniques, one major advantage of introducing naked 
nucleic acid sequences into target cells is the transitory nature of the polynucleotide 
synthesis in the cells. Studies have shown that non-replicating DNA sequences can be 
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introduced into cells to provide production of the desired polypeptide for periods of up to 
six months. 

The polynucleotide construct of the invention can be delivered to the interstitial 
space of tissues within the an animal, including of muscle, skin, brain, lung, liver, spleen, 
bone marrow, thymus, heart, lymph, blood, bone, cartilage, pancreas, kidney, gall bladder, 
stomach, intestine, testis, ovary, uterus, rectum, nervous system, eye, gland, and connective 
tissue. Interstitial space of the tissues comprises the intercellular, fluid, mucopolysaccharide 
matrix among the reticular fibers of organ tissues, elastic fibers in the walls of vessels or 
chambers, collagen fibers of fibrous tissues, or that same matrix within connective tissue 
ensheathing muscle cells or in the lacunae of bone. It is similarly the space occupied by the 
plasma of the circulation and the lymph fluid of the lymphatic channels. Delivery to the 
interstitial space of muscle tissue is preferred for the reasons discussed below. They may be 
conveniently delivered by injection into the tissues comprising these cells. They are 
preferably delivered to and expressed in persistent, non-dividing cells which are 
differentiated, although delivery and expression may be achieved in non-differentiated or 
less completely differentiated cells, such as, for example, stem cells of blood or skin 
fibroblasts. In vivo muscle cells are particularly competent in their ability to take up and 
express polynucleotides. 

For the nskedtiucleic acid sequence injection, an effective dosage amount of DNA 
or RN A will be in the range of from about 0.05 mg/kg body weight to about 50 mg/kg body 
weight. Preferably the dosage will be from about 0.005 mg/kg to about 20 mg/kg and more 
preferably from about 0.05 mg/kg to about 5 mg/kg. Of course, as the artisan of ordinary 
skill will appreciate, this dosage will vary according to the tissue site of injection. The 
appropriate and effective dosage of nucleic acid sequence can readily be determined by 
those of ordinary skill in the art and may depend on the condition being treated and the 
route of administration. 

The preferred route of administration is by the parenteral route of injection into the 
interstitial space of tissues. However, other parenteral routes may also be used, such as, 
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inhalation of an aerosol formulation particularly for delivery to lungs or bronchial tissues, 
throat or mucous membranes of the nose. In addition, naked DNA constructs can be 
delivered to arteries during angioplasty by the catheter used in the procedure. 

The naked polynucleotides are delivered by any method known in the art, 
including, but not limited to, direct needle injection at the delivery site, intravenous 
injection, topical administration, catheter infusion, and so-called "gene guns". These 
delivery methods are known in the art. 

The constructs may also be delivered with delivery vehicles such as viral 
sequences, viral particles, liposome formulations, lipofectin, precipitating agents, etc. Such 
methods of delivery are known in the art. 

In certain embodiments, the polynucleotide constructs of the invention are 
complexed in a liposome preparation. Liposomal preparations for use in the instant 
invention include cationic (positively charged), anionic (negatively charged) and neutral 
preparations. However, cationic liposomes are particularly preferred because a tight charge 
complex can be formed between the cationic liposome and the polyanionic nucleic acid. 
Cationic liposomes have been shown to mediate intracellular delivery of plasmid DNA 
(Feigner et al., Proc. Natl. Acad, Sci. USA , 84:7413-7416 (1987), which is herein 
incorporated by reference); mRNA (Malone et al., Proc. Natl. Acad. Sci. USA , 
86:6077-6081 (1989), which is herein incorporated by reference); and purified 
transcription factors (Debs et al., J. Biol. Chem., 265:10189-10192 (1990), which is herein 
incorporated by reference), in functional form. 

Cationic liposomes are readily available. For example, 
N[l-2,3-dioleyloxy)propyl]-N,N,N-triethylammonium (DOTMA) liposomes are 
particularly useful and are available under the trademark Lipofectin, from GIBCO BRL, 
Grand Island, N.Y. (See, also, Feigner et al., Proc. Natl Acad. Sci. USA , 84:7413-7416 
(1987), which is herein incorporated by reference). Other commercially available 
liposomes include transfectace (DDAB/DOPE) and DOTAP/DOPE (Boehringer). 
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Other cationic liposomes can be prepared from readily available materials using 
techniques well known in the art. See, e.g. PCT Publication NO: WO 90/1 1092 (which is 
herein incorporated by reference) for a description of the synthesis of DOTAP (1,2- 
bis(oleoyloxy)-3-(trimethylammonio)propane) liposomes. Preparation of DOTMA 
5 liposomes is explained in the literature, see, e.g., Feigner et al., Proc. Natl. Acad. Sci. 
USA, 84:7413-7417, which is herein incorporated by reference. Similar methods can be 
used to prepare liposomes from other cationic lipid materials. 

Similarly, anionic and neutral liposomes are readily available, such as from Avanti 
Polar Lipids (Birmingham, Ala.), or can be easily prepared using readily available 

10 materials. Such materials include phosphatidyl, choline, cholesterol, phosphatidyl 
ethanolamine, dioleoylphosphatidyl choline (DOPC), dioleoylphosphatidyl glycerol 
(DOPG), dioleoylphoshatidyl ethanolamine (DOPE), among others. These materials can 
also be mixed with the DOTMA and DOTAP starting materials in appropriate ratios. 
Methods for making liposomes using these materials are well known in the art. 

15 For example, commercially dioleoylphosphatidyl choline (DOPC), 

dioleoylphosphatidyl glycerol (DOPG), and dioleoylphosphatidyl ethanolamine (DOPE) 
can be used in various combinations to make conventional liposomes, with or without the 
addition of cholesterol. Thus, for example, DOPG/DOPC vesicles can be prepared by 
drying 50 mg each of DOPG and DOPC under a stream of nitrogen gas into a sonication 

20 vial. The sample is placed under a vacuum pump overnight and is hydrated the following 
day with deionized water. The sample is then sonicated for 2 hours in a capped vial, using 
a Heat Systems model 350 sonicator equipped with an inverted cup (bath type) probe at the 
maximum setting while the bath is circulated at 15EC. Alternatively, negatively charged 
vesicles can be prepared without sonication to produce multilamellar vesicles or by 

25 extrusion through nucleopore membranes to produce unilamellar vesicles of discrete size. 
Other methods are known and available to those of skill in the art. 

The liposomes can comprise multilamellar vesicles (MLVs), small unilamellar 
vesicles (SUVs), or large unilamellar vesicles (LUVs), with SUVs being preferred. The 
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various liposome-nucleic acid complexes are prepared using methods well known in the 
art. See, e.g., Straubinger et al., Methods of Immunology , 101:512-527 (1983), which is 
herein incorporated by reference. For example, MLVs containing nucleic acid can be 
prepared by depositing a thin film of phospholipid on the walls of a glass tube and 
subsequently hydrating with a solution of the material to be encapsulated. SUVs are 
prepared by extended sonication of MLVs to produce a homogeneous population of 
unilamellar liposomes. The material to be entrapped is added to a suspension of preformed 
MLVs and then sonicated. When using liposomes containing cationic lipids, the dried lipid 
film is resuspended in an appropriate solution such as sterile water or an isotonic buffer 
solution such as 10 mM Tris/NaCl, sonicated, and then the preformed liposomes are mixed 
directly with the DNA. The liposome and DNA form a very stable complex due to binding 
of the positively charged liposomes to the cationic DNA. SUVs find use with small nucleic 
acid fragments. LUVs are prepared by a number of methods, well known in the art. 
Commonly used methods include Ca 2+ -EDTA chelation (Papahadjopoulos et al., Biochim. 
Biophys. Acta, 394:483 (1975); Wilson et al., Cell , 17:77 (1979)); ether injection (Deamer 
et al., Biochim. Biophys. Acta, 443:629 (1976); Ostro et al., Biochem. Biophys. Res. 
Commun., 76:836 (1977); Fraley et al., Proc. Natl. Acad. Sci. USA, 76:3348 (1979)); 
detergent dialysis (Enoch et al., Proc. Natl. Acad. Sci. USA , 76:145 (1979)); and 
reverse-phase evaporation (REV) (Fraley et al., J. Biol. Chem., 255:10431 (1980); Szoka 
et al., Proc. Natl. Acad. Sci. USA , 75:145 (1978); Schaefer-Ridder et al., Science, 215:166 
(1982)), which are herein incorporated by reference. 

Generally, the ratio of DNA to liposomes will be from about 10:1 to about 1:10. 
Preferably, the ration will be from about 5:1 to about 1:5. More preferably, the ration will 
be about 3:1 to about 1:3. Still more preferably, the ratio will be about 1:1. 

U.S. Patent NO: 5,676,954 (which is herein incorporated by reference) reports on 
the injection of genetic material, complexed with cationic liposomes carriers, into mice. 
U.S. Patent Nos. 4,897,355, 4,946,787, 5,049,386, 5,459,127, 5,589,466, 5,693,622, 
5,580,859, 5,703,055, and international publication NO: WO 94/9469 (which are herein 
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incorporated by reference) provide cationic lipids for use in transfecting DNA into cells 
and mammals. U.S. Patent Nos. 5,589,466, 5,693,622, 5,580,859, 5,703,055, and 
international publication NO: WO 94/9469 (which are herein incorporated by reference) 
provide methods for delivering DNA-cationic lipid complexes to mammals. 

In certain embodiments, cells are engineered, ex vivo or in vivo, using a retroviral 
particle containing RNA which comprises a sequence encoding polypeptides of the 
invention. Retroviruses from which the retroviral plasmid vectors may be derived include, 
but are not limited to, Moloney Murine Leukemia Virus, spleen necrosis virus, Rous 
sarcoma Virus, Harvey Sarcoma Virus, avian leukosis virus, gibbon ape leukemia virus, 
human immunodeficiency virus, Myeloproliferative Sarcoma Virus, and mammary tumor 
virus. 

The retroviral plasmid vector is employed to transduce packaging cell lines to form 
producer cell lines. Examples of packaging cells which may be transfected include, but are 
not limited to, the PE501, PA317, R-2, R-AM, PA12, T19-14X, VT-19-17-H2, RCRE, 
RCRIP, GP+E-86, GP+envAml2, and DAN cell lines as described in Miller, Human Gene 
Therapy , 1:5-14 (1990), which is incorporated herein by reference in its entirety. The 
vector may transduce the packaging cells through any means known in the art. Such 
means include, but are not limited to, electroporation, the use of liposomes, and CaP0 4 
precipitation. In one alternative, the retroviral plasmid vector may be encapsulated into a 
liposome, or coupled to a lipid, and then administered to a host. 

The producer cell line generates infectious retroviral vector particles which include 
polynucleotide encoding polypeptides of the invention. Such retroviral vector particles 
then may be employed, to transduce eukaryotic cells, either in vitro or in vivo. The 
transduced eukaryotic cells will express polypeptides of the invention. 

In certain other embodiments, cells are engineered, ex vivo or in vivo, with 
polynucleotides of the invention contained in an adenovirus vector. Adenovirus can be 
manipulated such that it encodes and expresses polypeptides of the invention, and at the 
same time is inactivated in terms of its ability to replicate in a normal lytic viral life cycle. 
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Adenovirus expression is achieved without integration of the viral DNA into the host cell 
chromosome, thereby alleviating concerns about insertional mutagenesis. Furthermore, 
adenoviruses have been used as live enteric vaccines for many years with an excellent 
safety profile (Schwartzet al., Am. Rev. Respir. Dis., 109:233-238 (1974)). Finally, 
adenovirus mediated gene transfer has been demonstrated in a number of instances 
including transfer of alpha- 1 -antitrypsin and CFTR to the lungs of cotton rats (Rosenfeld et 
al., Science , 252:431-434 (1991); Rosenfeld et a]., Cell, 68:143-155 (1992)). Furthermore, 
extensive studies to attempt to establish adenovirus as a causative agent in human cancer 
were uniformly negative (Green et al. Proc. Natl. Acad. Sci. USA , 76:6606 (1979)). 

Suitable adenoviral vectors useful in the present invention are described, for 
example, in Kozarsky and Wilson, Curr. Opin. Genet. Devel., 3:499-503 (1993); Rosenfeld 
et al., Cell , 68:143-155 (1992); Engelhardt et ah, Human Genet. Ther., 4:759-769 (1993); 
Yang et al., Nature Genet., 7:362-369 (1994); Wilson et al., Nature , 365:691-692 ( 1993); 
and U.S. Patent NO: 5,652,224, which are herein incorporated by reference. For example, 
the adenovirus vector Ad2 is useful and can be grown in human 293 cells. These cells 
contain the El region of adenovirus and constitutively express Ela and Elb, which 
complement the defective adenoviruses by providing the products of the genes deleted 
from the vector. In addition to Ad2, other varieties of adenovirus (e.g., Ad3, Ad5, and 
Ad7) are also useful in the present invention. 

Preferably, the adenoviruses used in the present invention are replication deficient. 
Replication deficient adenoviruses require the aid of a helper virus and/or packaging cell 
line to form infectious particles. The resulting virus is capable of infecting cells and can 
express a polynucleotide of interest which is operably linked to a promoter, but cannot 
replicate in most cells. Replication deficient adenoviruses may be deleted in one or more 
of all or a portion of the following genes: Ela, Elb, E3, E4, E2a, or LI through L5. 

In certain other embodiments, the cells are engineered, ex vivo or in vivo, using an 
adeno-associated virus (AAV). AAVs are naturally occurring defective viruses that 
require helper viruses to produce infectious particles (Muzyczka, Curr. Topics in 
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Microbiol. Immunol., 158:97 (1992)). It is also one of the few viruses that may integrate 
its DNA into non-dividing cells. Vectors containing as little as 300 base pairs of AAV can 
be packaged and can integrate, but space for exogenous DNA is limited to about 4.5 kb. 
Methods for producing and using such AAVs are known in the art. See, for example, U.S. 
Patent Nos. 5,139,941, 5,173,414, 5,354,678, 5,436,146, 5,474,935, 5,478,745, and 
5,589,377. 

For example, an appropriate AAV vector for use in the present invention will 
include all the sequences necessary for DNA replication, encapsidation, and host-cell 
integration. The polynucleotide construct containing polynucleotides of the invention is 
inserted into the AAV vector using standard cloning methods, such as those found in 
Sambrook et al., Molecular Cloning: A Laboratory Manual, Cold Spring Harbor Press 
(1989). The recombinant AAV vector is then transfected into packaging cells which are 
infected with a helper virus, using any standard technique, including Hpofection, 
electroporation, calcium phosphate precipitation, etc. Appropriate helper viruses include 
adenoviruses, cytomegaloviruses, vaccinia viruses, or herpes viruses. Once the packaging 
cells are transfected and infected, they will produce infectious AAV viral particles which 
contain the polynucleotide construct of the invention. These viral particles are then used to 
transduce eukaryotic cells, either ex vivo or in vivo. The transduced cells will contain the 
polynucleotide construct integrated into its genome, and will express the desired gene 
product. 

Another method of gene therapy involves operably associating heterologous control 
regions and endogenous polynucleotide sequences (e.g. encoding the polypeptide sequence 
of interest) via homologous recombination (see, e.g., U.S. Patent NO: 5,641,670, issued 
June 24, 1997; International Publication NO: WO 96/29411, published September 26, 
1996; Internationa] Publication NO: WO 94/12650, published August 4, 1994; Koller et 
al., Proc. Natl, Acad. Sci. USA, 86:8932-8935 (1989); and Zijlstra et al., Nature, 
342:435-438 (1989). This method involves the activation of a gene which is present in the 
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target cells, but which is not normally expressed in the cells, or is expressed at a lower 
level than desired. 

Polynucleotide constructs are made, using standard techniques known in the art, 
which contain the promoter with targeting sequences flanking the promoter. Suitable 
promoters are described herein. The targeting sequence is sufficiently complementary to 
an endogenous sequence to permit homologous recombination of the promoter-targeting 
sequence with the endogenous sequence. The targeting sequence will be sufficiently near 
the 5' end of the desired endogenous polynucleotide sequence so the promoter will be 
operably linked to the endogenous sequence upon homologous recombination. 

The promoter and the targeting sequences can be amplified using PCR. Preferably, 
the amplified promoter contains distinct restriction enzyme sites on the 5' and V ends. 
Preferably, the 3' end of the first targeting sequence contains the same restriction enzyme 
site as the 5' end of the amplified promoter and the 5" end of the second targeting 
sequence contains the same restriction site as the 3' end of the amplified promoter. The 
amplified promoter and targeting sequences are digested and ligated together. 

The promoter-targeting sequence construct is delivered to the cells, either as naked 
polynucleotide, or in conjunction with transfection -facilitating agents, such as liposomes, 
viral sequences, viral particles, whole viruses, lipofection, precipitating agents, etc., 
described in more detail above. The P promoter-targeting sequence can be delivered by 
any method, included direct needle injection, intravenous injection, topical administration, 
catheter infusion, particle accelerators, etc. The methods are described in more detail 
below. 

The promoter-targeting sequence construct is taken up by cells. Homologous 
recombination between the construct and the endogenous sequence takes place, such that 
an endogenous sequence is placed under the control of the promoter. The promoter then 
drives the expression of the endogenous sequence. 

The polynucleotides encoding polypeptides of the present invention may be 
administered along with other polynucleotides encoding other angion genie proteins. 
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Angiogenic proteins include, but are not limited to, acidic and basic fibroblast growth 
factors, VEGF-1, VEGF-2 (VEGF-C), VEGF-3 (VEGF-B), epidermal growth factor alpha 
and beta, platelet-derived endothelial cell growth factor, platelet-derived growth factor, 
tumor necrosis factor alpha, hepatocyte growth factor, insulin like growth factor, colony 
5 stimulating factor, macrophage colony stimulating factor, granulocyte/macrophage colony 
stimulating factor, and nitric oxide synthase. 

Preferably, the polynucleotide encoding a polypeptide of the invention contains a 
secretory signal sequence that facilitates secretion of the protein. Typically, the signal 
sequence is positioned in the coding region of the polynucleotide to be expressed towards 

10 or at the 5' end of the coding region. The signal sequence may be homologous or 

heterologous to the polynucleotide of interest and may be homologous or heterologous to 
the cells to be transfected. Additionally, the signal sequence may be chemically 
synthesized using methods known in the art. 

Any mode of administration of any of the above-described polynucleotides 

15 constructs can be used so long as the mode results in the expression of one or more 

molecules in an amount sufficient to provide a therapeutic effect. This includes direct 
needle injection, systemic injection, catheter infusion, biolistic injectors, particle 
accelerators (i.e., "gene guns"), gelfoam sponge depots, other commercially available 
depot materials, osmotic pumps (e.g., Alza minipumps), oral or suppositorial solid (tablet 

20 or pill) pharmaceutical formulations, and decanting or topical applications during surgery. 
For example, direct injection of naked calcium phosphate-precipitated plasmid into rat 
liver and rat spleen or a protein -coated plasmid into the portal vein has resulted in gene 
expression of the foreign gene in the rat livers. (Kaneda et al., Science, 243:375 (1989)). 
A preferred method of local administration is by direct injection. Preferably, a 

25 recombinant molecule of the present invention complexed with a delivery vehicle is 

administered by direct injection into or locally within the area of arteries. Administration 
of a composition locally within the area of arteries refers to injecting the composition 
centimeters and preferably, millimeters within arteries. 
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Another method of local administration is to contact a polynucleotide construct of 
the present invention in or around a surgical wound. For example, a patient can undergo 
surgery and the polynucleotide construct can be coated on the surface of tissue inside the 
wound or the construct can be injected into areas of tissue inside the wound. 

Therapeutic compositions useful in systemic administration, include recombinant 
molecules of the present invention complexed to a targeted delivery vehicle of the present 
invention. Suitable delivery vehicles for use with systemic administration comprise 
liposomes comprising ligands for targeting the vehicle to a particular site. 

Preferred methods of systemic administration, include intravenous injection, 
aerosol, oral and percutaneous (topical) delivery. Intravenous injections can be performed 
using methods standard in the art. Aerosol delivery can also be performed using methods 
standard in the art (see, for example, Stribling et al., Proc. Natl. Acad. Sci. USA , 
189:11277-11281 (1992), which is incorporated herein by reference). Oral delivery can be 
performed by complexing a polynucleotide construct of the present invention to a carrier 
capable of withstanding degradation by digestive enzymes in the gut of an animal. 
Examples of such carriers, include plastic capsules or tablets, such as those known in the 
art. Topical delivery can be performed by mixing a polynucleotide construct of the present 
invention with a lipophilic reagent (e.g., DMSO) that is capable of passing into the skin. 

Determining an effective amount of substance to be delivered can depend upon a 
number of factors including, for example, the chemical structure and biological activity of 
the substance, the age and weight of the animal, the precise condition requiring treatment 
and its severity, and the route of administration. The frequency of treatments depends 
upon a number of factors, such as the amount of polynucleotide constructs administered 
per dose, as well as the health and history of the subject. The precise amount, number of 
doses, and timing of doses will be determined by the attending physician or veterinarian. 
Therapeutic compositions of the present invention can be administered to any animal, 
preferably to mammals and birds. Preferred mammals include humans, dogs, cats, mice, 
rats, rabbits sheep, cattle, horses and pigs, with humans being particularly 
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Biological Activities 

The polynucleotides or polypeptides, or agonists or antagonists of the present 
invention can be used in assays to test for one or more biological activities. If these 
polynucleotides and polypeptides do exhibit activity in a particular assay, it is likely that 
these molecules may be involved in the diseases associated with the biological activity. 
Thus, the polynucleotides or polypeptides, or agonists or antagonists could be used to treat 
the associated disease. 

Immune Activity 

The polynucleotides or polypeptides, or agonists or antagonists of the present 
invention may be useful in treating deficiencies or disorders of the immune system, by 
activating or inhibiting the proliferation, differentiation, or mobilization (chemotaxis) of 
immune cells. Immune cells develop through a process called hematopoiesis, producing 
myeloid (platelets, red blood cells, neutrophils, and macrophages) and lymphoid (B and T 
lymphocytes) cells from pluripotent stem cells. The etiology of these immune deficiencies 
or disorders may be genetic, somatic, such as cancer or some autoimmune disorders, 
acquired (e.g., by chemotherapy or toxins), or infectious. Moreover, a polynucleotides or 
polypeptides, or agonists or antagonists of the present invention can be used as a marker or 
detector of a particular immune system disease or disorder. 

A polynucleotides or polypeptides, or agonists or antagonists of the present 
invention may be useful in treating or detecting deficiencies or disorders of hematopoietic 
cells. A polynucleotides or polypeptides, or agonists or antagonists of the present 
invention could be used to increase differentiation and proliferation of hematopoietic cells, 
including the pluripotent stem cells, in an effort to treat those disorders associated with a 
decrease in certain (or many) types hematopoietic cells. Examples of immunologic 
deficiency syndromes include, but are not limited to: blood protein disorders (e.g. 
agammaglobulinemia, dysgammaglobulinemia), ataxia telangiectasia, common variable 
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immunodeficiency, Digeorge Syndrome, HIV infection, HTLV-BLV infection, leukocyte 
adhesion deficiency syndrome, lymphopenia, phagocyte bactericidal dysfunction, severe 
combined immunodeficiency (SCIDs), Wiskott-Aldrich Disorder, anemia, 
thrombocytopenia, or hemoglobinuria. 

Moreover, a polynucleotides or polypeptides, or agonists or antagonists of the 
present invention could also be used to modulate hemostatic (the stopping of bleeding) or 
thrombolytic activity (clot formation). For example, by increasing hemostatic or 
thrombolytic activity, a polynucleotides or polypeptides, or agonists or antagonists of the 
present invention could be used to treat blood coagulation disorders (e.g., afibrinogenemia, 
factor deficiencies), blood platelet disorders (e.g. thrombocytopenia), or wounds resulting 
from trauma, surgery, or other causes. Alternatively, a polynucleotides or polypeptides, or 
agonists or antagonists of the present invention that can decrease hemostatic or 
thrombolytic activity could be used to inhibit or dissolve clotting. These molecules could 
be important in the treatment of heart attacks (infarction), strokes, or scarring. 

A polynucleotides or polypeptides, or agonists or antagonists of the present 
invention may also be useful in treating or detecting autoimmune disorders. Many 
autoimmune disorders result from inappropriate recognition of self as foreign material by 
immune cells. This inappropriate recognition results in an immune response leading to the 
destruction of the host tissue. Therefore, the administration of a polynucleotides or 
polypeptides, or agonists or antagonists of the present invention that inhibits an immune 
response, particularly the proliferation, differentiation, or chemotaxis of T-celis, may be an 
effective therapy in preventing autoimmune disorders. 

Examples of autoimmune disorders that can be treated or detected by the present 
invention include, but are not limited to: Addison's Disease, hemolytic anemia, 
antiphospholipid syndrome, rheumatoid arthritis, dermatitis, allergic encephalomyelitis, 
glomerulonephritis, Goodpasture's Syndrome, Graves* Disease, Multiple Sclerosis, 
Myasthenia Gravis, Neuritis, Ophthalmia, Bullous Pemphigoid, Pemphigus, 
Polyendocrinopathies, Purpura, Reiter's Disease, Stiff-Man Syndrome, Autoimmune 
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Thyroiditis, Systemic Lupus Erythematosus, Autoimmune Pulmonary Inflammation, 
Guillain-Barre Syndrome, insulin dependent diabetes mellitis, and autoimmune 
inflammatory eye disease. 

Similarly, allergic reactions and conditions, such as asthma (particularly allergic 
asthma) or other respiratory problems, may also be treated by a polynucleotides or 
polypeptides, or agonists or antagonists of the present invention. Moreover, these 
molecules can be used to treat anaphylaxis, hypersensitivity to an antigenic molecule, or 
blood group incompatibility. 

A polynucleotides or polypeptides, or agonists or antagonists of the present 
invention may also be used to treat and/or prevent organ rejection or graft- versus-host 
disease (GVHD). Organ rejection occurs by host immune cell destruction of the 
transplanted tissue through an immune response. Similarly, an immune response is also 
involved in GVHD, but, in this case, the foreign transplanted immune cells destroy the host 
tissues. The administration of a polynucleotides or polypeptides, or agonists or antagonists 
of the present invention that inhibits an immune response, particularly the proliferation, 
differentiation, or chernotaxis of T-cells, may be an effective therapy in preventing organ 
rejection or GVHD. 

Similarly, a polynucleotides or polypeptides, or agonists or antagonists of the 
present invention may also be used to modulate inflammation. For example, the 
polypeptide or polynucleotide or agonists or antagonist may inhibit the proliferation and 
differentiation of cells involved in an inflammatory response. These molecules can be 
used to treat inflammatory conditions, both chronic and acute conditions, including 
inflammation associated with infection (e.g., septic shock, sepsis, or systemic 
inflammatory response syndrome (SIRS)), ischemia-reperfusion injury, endotoxin 
lethality, arthritis, complement-mediated hyperacute rejection, nephritis, cytokine or 
chemokine induced lung injury, inflammatory bowel disease, Crohn's disease, or resulting 
from over production of cytokines (e.g., TNF or IL-1.) 
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Hvperproliferative Disorders 

A polynucleotides or polypeptides, or agonists or antagonists of the invention can 
be used to treat or detect hyperproliferative disorders, including neoplasms. A 
polynucleotides or polypeptides, or agonists or antagonists of the present invention may 
inhibit the proliferation of the disorder through direct or indirect interactions. 
Alternatively, a polynucleotides or polypeptides, or agonists or antagonists of the present 
invention may proliferate other cells which can inhibit the hyperproliferative disorder. 

For example, by increasing an immune response, particularly increasing antigenic 
qualities of the hyperproliferative disorder or by proliferating, differentiating, or 
mobilizing T-cells, hyperproliferative disorders can be treated. This immune response may 
be increased by either enhancing an existing immune response, or by initiating a new 
immune response. Alternatively, decreasing an immune response may also be a method of 
treating hyperproliferative disorders, such as a chemotherapeutic agent. 

Examples of hyperproliferative disorders that can be treated or detected by a 
polynucleotides or polypeptides, or agonists or antagonists of the present invention 
include, but are not limited to neoplasms located in the: abdomen, bone, breast, digestive 
system, liver, pancreas, peritoneum, endocrine glands (adrenal, parathyroid, pituitary, 
testicles, ovary, thymus, thyroid), eye, head and neck, nervous (central and peripheral), 
lymphatic system, pelvic, skin, soft tissue, spleen, thoracic, and urogenital. 

Similarly, other hyperproliferative disorders can also be treated or detected by a 
polynucleotides or polypeptides, or agonists or antagonists of the present invention. 
Examples of such hyperproliferative disorders include, but are not limited to: 
hypergammaglobulinemia, lymphoproliferative disorders, paraproteinemias, purpura, 
sarcoidosis, Sezary Syndrome, Waldenstron's Macroglobulinemia, Gaucher' s Disease, 
histiocytosis, and any other hyperproliferative disease, besides neoplasia, located in an 
organ system listed above. 
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One preferred embodiment utilizes polynucleotides of the present invention to 
inhibit aberrant cellular division, by gene therapy using the present invention, and/or 
protein fusions or fragments thereof. 

Thus, the present invention provides a method for treating cell proliferative 
disorders by inserting into an abnormally proliferating cell a polynucleotide of the present 
invention, wherein said polynucleotide represses said expression. 

Another embodiment of the present invention provides a method of treating cell- 
proliferative disorders in individuals comprising administration of one or more active gene 
copies of the present invention to an abnormally proliferating cell or cells. In a preferred 
embodiment, polynucleotides of the present invention is a DNA construct comprising a 
recombinant expression vector effective in expressing a DNA sequence encoding said 
polynucleotides. In another preferred embodiment of the present invention, the DNA 
construct encoding the poynucleotides of the present invention is inserted into cells to be 
treated utilizing a retrovirus, or more preferrably an adenoviral vector (See G J. Nabel, et. 
al., PNAS 1999 96: 324-326, which is hereby incorporated by reference). In a most 
preferred embodiment, the viral vector is defective and will not transform non-proliferating 
cells, only proliferating cells. Moreover, in a preferred embodiment, the polynucleotides 
of the present invention inserted into proliferating cells either alone, or in combination 
with or fused to other polynucleotides, can then be modulated via an external stimulus (i.e. 
magnetic, specific small molecule, chemical, or drug administration, etc.), which acts upon 
the promoter upstream of said polynucleotides to induce expression of the encoded protein 
product. As such the beneficial therapeutic affect of the present invention may be 
expressly modulated (i.e. to increase, decrease, or inhibit expression of the present 
invention) based upon said external stimulus. 

Polynucleotides of the present invention may be useful in repressing expression of 
oncogenic genes or antigens. By "repressing expression of the oncogenic genes " is 
intended the suppression of the transcription of the gene, the degradation of the gene 
transcript (pre-message RNA), the inhibition of splicing, the destruction of the messenger 
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RNA, the prevention of the post-translational modifications of the protein, the destruction 
of the protein, or the inhibition of the normal function of the protein. 

For local administration to abnormally proliferating cells, polynucleotides of the 
present invention may be administered by any method known to those of skill in the art 
including, but not limited to transfection, electroporation, microinjection of cells, or in 
vehicles such as liposomes, lipofectin, or as naked polynucleotides, or any other method 
described throughout the specification. The polynucleotide of the present invention may 
be delivered by known gene delivery systems such as, but not limited to, retroviral vectors 
(Gilboa, J. Virology 44:845 (1982); Hocke, Nature 320:275 (1986); Wilson, et al., Proc. 
Natl. Acad. Sci. U.S.A. 85:3014), vaccinia virus system (Chakrabarty et al., Mol. Cell 
Biol. 5:3403 (1985) or other efficient DNA delivery systems (Yates et al., Nature 313:812 
(1985)) known to those skilled in the art. These references are exemplary only and are 
hereby incorporated by reference. In order to specifically deliver or transfect cells which 
are abnormally proliferating and spare non-dividing cells, it is preferable to utilize a 
retrovirus, or adenoviral (as described in the art and elsewhere herein) delivery system 
known to those of skill in the art. Since host DNA replication is required for retroviral 
DNA to integrate and the retrovirus will be unable to self replicate due to the lack of the 
retrovirus genes needed for its life cycle. Utilizing such a retroviral delivery system for 
polynucleotides of the present invention will target said gene and constructs to abnormally 
proliferating cells and will spare the non-dividing normal cells. 

The polynucleotides of the present invention may be delivered directly to cell 
proliferative disorder/disease sites in internal organs, body cavities and the like by use of 
imaging devices used to guide an injecting needle directly to the disease site. The 
polynucleotides of the present invention may also be administered to disease sites at the 
time of surgical intervention. 

By "cell proliferative disease" is meant any human or animal disease or disorder, 
affecting any one or any combination of organs, cavities, or body parts, which is 
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characterized by single or multiple local abnormal proliferations of cells, groups of cells, 
or tissues, whether benign or malignant. 

Any amount of the polynucleotides of the present invention may be administered as 
long as it has a biologically inhibiting effect on the proliferation of the treated cells. 
Moreover, it is possible to administer more than one of the polynucleotide of the present 
invention simultaneously to the same site. By "biologically inhibiting" is meant partial or 
total growth inhibition as well as decreases in the rate of proliferation or growth of the 
cells. The biologically inhibitory dose may be determined by assessing the effects of the 
polynucleotides of the present invention on target malignant or abnormally proliferating 
cell growth in tissue culture, tumor growth in animals and cell cultures, or any other 
method known to one of ordinary skill in the art. 

The present invention is further directed to antibody -based therapies which involve 
administering of anti-polypeptides and anti-polynucleotide antibodies to a mammalian, 
preferably human, patient for treating one or more of the described disorders. Methods for 
producing anti-polypeptides and anti-polynucleotide antibodies polyclonal and monoclonal 
antibodies are described in detail elsewhere herein. Such antibodies may be provided in 
pharmaceutical ly acceptable compositions as known in the art or as described herein. 

A summary of the ways in which the antibodies of the present invention may be 
used therapeutically includes binding polynucleotides or polypeptides of the present 
invention locally or systemically in the body or by direct cytotoxicity of the antibody, e.g. 
as mediated by complement (CDC) or by effector cells (ADCC). Some of these 
approaches are described in more detail below. Armed with the teachings provided herein, 
one of ordinary skill in the art will know how to use the antibodies of the present invention 
for diagnostic, monitoring or therapeutic purposes without undue experimentation. 

In particular, the antibodies, fragments and derivatives of the present invention are 
useful for treating a subject having or developing cell proliferative and/or differentiation 
disorders as described herein. Such treatment comprises administering a single or multiple 
doses of the antibody, or a fragment, derivative, or a conjugate thereof. 
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The antibodies of this invention may be advantageously utilized in combination 
with other monoclonal or chimeric antibodies, or with lymphokines or hematopoietic 
growth factors, for example, which serve to increase the number or activity of effector 
cells which interact with the antibodies. 

It is preferred to use high affinity and/or potent in vivo inhibiting and/or 
neutralizing antibodies against polypeptides or polynucleotides of the present invention, 
fragments or regions thereof, for both immunoassays directed to and therapy of disorders 
related to polynucleotides or polypeptides, including fragements thereof, of the present 
invention. Such antibodies, fragments, or regions, will preferably have an affinity for 
polynucleotides or polypeptides, including fragements thereof. Preferred binding affinities 
include those with a dissociation constant or Kd less than 5X10* 6 M, 10" 6 M, 5X10" 7 M, 10" 
7 M, 5X10' 8 M, 10- 8 M, 5X10 y M, 10" 9 M, 5X10- 10 M, 10- |0 M, 5X10 n M, 10- 1] M, 5X10 12 M, 10 
n M, 5X10^M, 10 13 M, 5X10 !4 M, 10 ,4 M, 5X10" ,5 M, and I0 15 M. 

Moreover, polypeptides of the present invention are useful in inhibiting the 
angiogenesis of proliferative cells or tissues, either alone, as a protein fusion, or in 
combination with other polypeptides directly or indirectly, as described elsewhere herein. 
In a most preferred embodiment, said anti -angiogenesis effect may be achieved indirectly, 
for example, through the inhibition of hematopoietic, tumor-specific cells, such as tumor- 
associated macrophages (See Joseph IB, et al. J Natl Cancer Inst, 90(21):1648-53 (1998), 
which is hereby incorporated by reference). Antibodies directed to polypeptides or 
polynucleotides of the present invention may also result in inhibition of angiogenesis 
directly, or indirectly (See Witte L, et al., Cancer Metastasis Rev. 17(2):155-61 (1998), 
which is hereby incorporated by reference)). 

Polypeptides, including protein fusions, of the present invention, or fragments 
thereof may be useful in inhibiting proliferative cells or tissues through the induction of 
apoptosis. Said polypeptides may act either directly, or indirectly to induce apoptosis of 
proliferative cells and tissues, for example in the activation of a death-domain receptor, 
such as tumor necrosis factor (TNF) receptor- 1, CD95 (Fas/APO-1), TNF-receptor-related 
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apoptosis-mediated protein (TRAMP) and TNF-related apoptosis-inducing ligand (TRAIL) 
receptor-1 and -2 (See Schulze-Osthoff K, et.al., Eur J Biochem 254(3):439-59 (1998), 
which is hereby incorporated by reference). Moreover, in another preferred embodiment 
of the present invention, said polypeptides may induce apoptosis through other 
mechanisms, such as in the activation of other proteins which will activate apoptosis, or 
through stimulating the expression of said proteins, either alone or in combination with 
small molecule drugs or adjuviants, such as apoptonin, galectins, thioredoxins, 
antiinflammatory proteins (See for example, Mutat Res 400(1 -2):447-55 (1998), Med 
Hypotheses.50(5):423-33 (1998), Chem Biol Interact. Apr 24;1 1 1-112:23-34 (1998), J Mol 
Med.76(6):402-12 (1998), Int J Tissue React;20(l):3-I5 (1998), which are all hereby 
incorporated by reference). 

Polypeptides, including protein fusions to, or fragments thereof, of the present 
invention are useful in inhibiting the metastasis of proliferative cells or tissues. Inhibition 
may occur as a direct result of administering polypeptides, or antibodies directed to said 
polypeptides as described elsewere herein, or indirectly, such as activating the expression 
of proteins known to inhibit metastasis, for example alpha 4 integrins, (See, e.g., Curr Top 
Microbiol Immunol 1998;231:125-41, which is hereby incorporated by reference). Such 
thereapeutic affects of the present invention may be achieved either alone, or in 
combination with small molecule drugs or adjuvants. 

In another embodiment, the invention provides a method of delivering 
compositions containing the polypeptides of the invention (e.g., compositions containing 
polypeptides or polypeptide antibodes associated with heterologous polypeptides, 
heterologous nucleic acids, toxins, or prodrugs) to targeted cells expressing the polypeptide 
of the present invention. Polypeptides or polypeptide antibodes of the invention may be 
associated with with heterologous polypeptides, heterologous nucleic acids, toxins, or 
prodrugs via hydrophobic, hydrophilic, ionic and/or covalent interactions. 
Polypeptides, protein fusions to, or fragments thereof, of the present invention are useful in 
enhancing the immunogenicity and/or antigenicity of proliferating cells or tissues, either 
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directly, such as would occur if the polypeptides of the present invention 'vaccinated' the 
immune response to respond to proliferative antigens and immunogens, or indirectly, such 
as in activating the expression of proteins known to enhance the immune response (e.g. 
chemokines), to said antigens and immunogens. 

Cardiovascular Disorders 

Polynucleotides or polypeptides, or agonists or antagonists of the invention may be 
used to treat cardiovascular disorders, including peripheral artery disease, such as limb 
ischemia. 

Cardiovascular disorders include cardiovascular abnormalities, such as arterio- 
arterial fistula, arteriovenous fistula, cerebral arteriovenous malformations, congenital 
heart defects, pulmonary atresia, and Scimitar Syndrome. Congenital heart defects 
include aortic coarctation, cor triatriatum, coronary vessel anomalies, crisscross heart, 
dextrocardia, patent ductus arteriosus, Ebstein's anomaly, Eisenmenger complex, 
hypoplastic left heart syndrome, levocardia, tetralogy of fallot, transposition of great 
vessels, double outlet right ventricle, tricuspid atresia, persistent truncus arteriosus, and 
heart septal defects, such as aortopulmonary septal defect, endocardial cushion defects, 
Lutembacher's Syndrome, trilogy of Fallot, ventricular heart septal defects. 

Cardiovascular disorders also include heart disease, such as arrhythmias, carcinoid 
heart disease, high cardiac output, low cardiac output, cardiac tamponade, endocarditis 
(including bacterial), heart aneurysm, cardiac arrest, congestive heart failure, congestive 
cardiomyopathy, paroxysmal dyspnea, cardiac edema, heart hypertrophy, congestive 
cardiomyopathy, left ventricular hypertrophy, right ventricular hypertrophy, post-infarction 
heart rupture, ventricular septal rupture, heart valve diseases, myocardial diseases, 
myocardial ischemia, pericardial effusion, pericarditis (including constrictive and 
tuberculous), pneumopericardium, postpericardiotomy syndrome, pulmonary heart disease, 
rheumatic heart disease, ventricular dysfunction, hyperemia, cardiovascular pregnancy 
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complications, Scimitar Syndrome, cardiovascular syphilis, and cardiovascular 
tuberculosis. 

Arrhythmias include sinus arrhythmia, atrial fibrillation, atrial flutter, bradycardia, 
extrasystoie, Adams-Stokes Syndrome, bundle-branch block, sinoatrial block, long QT 
syndrome, parasystole, Lown-Ganong-Levine Syndrome, Mahaim-type pre-excitation 
syndrome, Wolff-Parkinson-White syndrome, sick sinus syndrome, tachycardias, and 
ventricular fibrillation. Tachycardias include paroxysmal tachycardia, supraventricular 
tachycardia, accelerated idioventricular rhythm, atrioventricular nodal reentry tachycardia, 
ectopic atrial tachycardia, ectopic junctional tachycardia, sinoatrial nodal reentry 
tachycardia, sinus tachycardia, Torsades de Pointes, and ventricular tachycardia. 

Heart valve disease include aortic valve insufficiency, aortic valve stenosis, hear 
murmurs, aortic valve prolapse, mitral valve prolapse, tricuspid valve prolapse, mitral 
valve insufficiency, mitral valve stenosis, pulmonary atresia, pulmonary valve 
insufficiency, pulmonary valve stenosis, tricuspid atresia, tricuspid yalve insufficiency, and 
tricuspid valve stenosis. 

Myocardial diseases include alcoholic cardiomyopathy, congestive 
cardiomyopathy, hypertrophic cardiomyopathy, aortic subvalvular stenosis, pulmonary 
subvalvular stenosis, restrictive cardiomyopathy, Chagas cardiomyopathy, endocardial 
fibroelastosis, endomyocardial fibrosis, Kearns Syndrome, myocardial reperfusion injury, 
and myocarditis. 

Myocardial ischemias include coronary disease, such as angina pectoris, coronary 
aneurysm, coronary arteriosclerosis, coronary thrombosis, coronary vasospasm, 
myocardial infarction and myocardial stunning. 

Cardiovascular diseases also include vascular diseases such as aneurysms, 
angiodysplasia, angiomatosis, bacillary angiomatosis, Hippel-Lindau Disease, Klippel- 
Trenaunay-Weber Syndrome, Sturge-Weber Syndrome, angioneurotic edema, aortic 
diseases, Takayasu's Arteritis, aortitis, Leriche's Syndrome, arterial occlusive diseases, 
arteritis, enarteritis, polyarteritis nodosa, cerebrovascular disorders, diabetic angiopathies, 
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diabetic retinopathy, embolisms, thrombosis, erythromelalgia, hemorrhoids, hepatic veno- 
occlusive disease, hypertension, hypotension, ischemia, peripheral vascular diseases, 
phlebitis, pulmonary veno-occlusive disease, Raynaud's disease, CREST syndrome, retinal 
vein occlusion, Scimitar syndrome, superior vena cava syndrome, telangiectasia, atacia 
telangiectasia, hereditary hemorrhagic telangiectasia, varicocele, varicose veins, varicose 
ulcer, vasculitis, and venous insufficiency. 

Aneurysms include dissecting aneurysms, false aneurysms, infected aneurysms, 
ruptured aneurysms, aortic aneurysms, cerebral aneurysms, coronary aneurysms, heart 
aneurysms, and iliac aneurysms. 

Arterial occlusive diseases include arteriosclerosis, intermittent claudication, 
carotid stenosis, fibromuscular dysplasias, mesenteric vascular occlusion, Moyamoya 
disease, renal artery obstruction, retinal artery occlusion, and thromboangiitis obliterans. 

Cerebrovascular disorders include carotid artery diseases, cerebral amyloid 
angiopathy, cerebral aneurysm, cerebral anoxia, cerebral arteriosclerosis, cerebral 
arteriovenous malformation, cerebral artery diseases, cerebral embolism and thrombosis, 
carotid artery thrombosis, sinus thrombosis, Wallenberg's syndrome, cerebral hemorrhage, 
epidural hematoma, subdural hematoma, subaraxhnoid hemorrhage, cerebral infarction, 
cerebral ischemia (including transient), subclavian steal syndrome, periventricular 
leukomalacia, vascular headache, cluster headache, migraine, and vertebrobasilar 
insufficiency. 

Embolisms include air embolisms, amniotic fluid embolisms, cholesterol 
embolisms, blue toe syndrome, fat embolisms, pulmonary embolisms, and 
thromoboembolisms. Thrombosis include coronary thrombosis, hepatic vein thrombosis, 
retinal vein occlusion, carotid artery thrombosis, sinus thrombosis, Wallenberg's syndrome, 
and thrombophlebitis. 

Ischemia includes cerebral ischemia, ischemic colitis, compartment syndromes, 
anterior compartment syndrome, myocardial ischemia, reperfusion injuries, and peripheral 
limb ischemia. Vasculitis includes aortitis, arteritis, Behcet's Syndrome, Churg-Strauss 
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Syndrome, mucocutaneous lymph node syndrome, thromboangiitis obliterans, 
hypersensitivity vasculitis, Schoenlein-Henoch purpura, allergic cutaneous vasculitis, and 
Wegener's granulomatosis. 

Polynucleotides or polypeptides, or agonists or antagonists of the invention, are 
especially effective for the treatment of critical limb ischemia and coronary disease. 

Polypeptides may be administered using any method known in the art, including, 
but not limited to, direct needle injection at the delivery site, intravenous injection, topical 
administration, catheter infusion, biolistic injectors, particle accelerators, gelfoam sponge 
depots, other commercially available depot materials, osmotic pumps, oral or suppositorial 
solid pharmaceutical formulations, decanting or topical applications during surgery, 
aerosol delivery. Such methods are known in the art. Polypeptides of the invention may be 
administered as part of a Therapeutic, described in more detail below. Methods of 
delivering polynucleotides of the invention are described in more detail herein. 

Anti-Angiogenesis Activity 

The naturally occurring balance between endogenous stimulators and inhibitors of 
angiogenesis is one in which inhibitory influences predominate. Rastinejad et al., Cell 
56:345-355 (1989). In those rare instances in which neovascularization occurs under 
normal physiological conditions, such as wound healing, organ regeneration, embryonic 
development, and female reproductive processes, angiogenesis is stringently regulated and 
spatially and temporally delimited. Under conditions of pathological angiogenesis such as 
that characterizing "solid tumor growth, these regulatory controls fail. Unregulated 
angiogenesis becomes pathologic and sustains progression of many neoplastic and non- 
neoplastic diseases. A number of serious diseases are dominated by abnormal 
neovascularization including solid tumor growth and metastases, arthritis, some types of 
eye disorders, and psoriasis. See, e.g., reviews by Moses et al, Biotech, 9:630-634 (1991); 
Folkman et al, N. Engl. J. Med., 353:1757-1763 (1995); Auerbach et al, J. Microvasc. 
Res. 29:401-411 (1985); Folkman, Advances in Cancer Research, eds. Klein and 
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Weinhouse, Academic Press, New York, pp. 175-203 (1985); Patz, Am. J. OpthalmoL 
94:715-743 (1982); and Folkman et al, Science 227:719-725 (1983). In a number of 
pathological conditions, the process of angiogenesis contributes to the disease state. For 
example, significant data have accumulated which suggest that the growth of solid tumors 
is dependent on angiogenesis. Folkman and Klagsbrun, Science 235:442-447 (1987). 

The present invention provides for treatment of diseases or disorders associated 
with neovascularization by administration of the polynucleotides and/or polypeptides of 
the invention, as well as agonists or antagonists of the present invention. Malignant and 
metastatic conditions which can be treated with the polynucleotides and polypeptides, or 
agonists or antagonists of the invention include, but are not limited to, malignancies, solid 
tumors, and cancers described herein and otherwise known in the art (for a review of such 
disorders, see Fishman et at, Medicine, 2d Ed., J. B. Lippincott Co., Philadelphia 
(1985)).Thus, the present invention provides a method of treating an angiogenesis-related 
disease and/or disorder, comprising administering to an individual in need thereof a 
therapeutically effective amount of a polynucleotide, polypeptide, antagonist and/or 
agonist of the invention. For example, polynucleotides, polypeptides, antagonists and/or 
agonists may be utilized in a variety of additional methods in order to therapeutically treat 
a cancer or tumor. Cancers which may be treated with polynucleotides, polypeptides, 
antagonists and/or agonists include, but are not limited to solid tumors, including prostate, 
lung, breast, ovarian, stomach, pancreas, larynx, esophagus, testes, liver, parotid, biliary 
tract, colon, rectum, cervix, uterus, endometrium, kidney, bladder, thyroid cancer; primary 
tumors and metastases; melanomas; glioblastoma; Kaposi's sarcoma; leiomyosarcoma; 
non- small cell lung cancer; colorectal cancer; advanced malignancies; and blood born 
tumors such as leukemias. For example, polynucleotides, polypeptides, antagonists and/or 
agonists may be delivered topically, in order to treat cancers such as skin cancer, head and 
neck tumors, breast tumors, and Kaposi's sarcoma. 

Within yet other aspects, polynucleotides, polypeptides, antagonists and/or agonists 
may be utilized to treat superficial forms of bladder cancer by, for example, intravesical 
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administration. Polynucleotides, polypeptides, antagonists and/or agonists may be 
delivered directly into the tumor, or near the tumor site, via injection or a catheter. Of 
course, as the artisan of ordinary skill will appreciate, the appropriate mode of 
administration will vary according to the cancer to be treated. Other modes of delivery are 
discussed herein. 

Polynucleotides, polypeptides, antagonists and/or agonists may be useful in treating 
other disorders, besides cancers, which involve angiogenesis. These disorders include, but 
are not limited to: benign tumors, for example hemangiomas, acoustic neuromas, 
neurofibromas, trachomas, and pyogenic granulomas; artheroscleric plaques; ocular 
angiogenic diseases, for example, diabetic retinopathy, retinopathy of prematurity, macular 
degeneration, corneal graft rejection, neovascular glaucoma, retrolental fibroplasia, 
rubeosis, retinoblastoma, uvietis and Pterygia (abnormal blood vessel growth) of the eye; 
rheumatoid arthritis; psoriasis; delayed wound healing; endometriosis; vasculogenesis; 
granulations; hypertrophic scars (keloids); nonunion fractures; scleroderma; trachoma; 
vascular adhesions; myocardial angiogenesis; coronary collaterals; cerebral collaterals; 
arteriovenous malformations; ischemic limb angiogenesis; Osier-Webber Syndrome; 
plaque neovascularization; telangiectasia; hemophiliac joints; angiofibroma; fibromuscular 
dysplasia; wound granulation; Crohn's disease; and atherosclerosis. 

For example, within one aspect of the present invention methods are provided for 
treating hypertrophic scars and keloids, comprising the step of administering a 
polynucleotide, polypeptide, antagonist and/or agonist of the invention to a hypertrophic 
scar or keloid. 

Within one embodiment of the present invention polynucleotides, polypeptides, 
antagonists and/or agonists are directly injected into a hypertrophic scar or keloid, in order 
to prevent the progression of these lesions. This therapy is of particular value in the 
prophylactic treatment of conditions which are known to result in the development of 
hypertrophic scars and keloids (e.g., burns), and is preferably initiated after the 
proliferative phase has had time to progress (approximately 14 days after the initial injury), 
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but before hypertrophic scar or keloid development. As rioted above, the present invention 
also provides methods for treating neovascular diseases of the eye, including for example, 
corneal neovascularization, neovascular glaucoma, proliferative diabetic retinopathy, 
retrolental fibroplasia and macular degeneration. 

Moreover, Ocular disorders associated with neovascularization which can be 
treated with the polynucleotides and polypeptides of the present invention (including 
agonists and/or antagonists) include, but are not limited to: neovascular glaucoma, diabetic 
retinopathy, retinoblastoma, retrolental fibroplasia, uveitis, retinopathy of prematurity 
macular degeneration, corneal graft neovascularization, as well as other eye inflammatory 
diseases, ocular tumors and diseases associated with choroidal or iris neovascularization. 
See, e.g., reviews by Waltman et al, Am. J. Ophthal 55:704-710 (1978) and Gartner et 
al, Surv. Ophthal 22:291-312 (1978). 

Thus, within one aspect of the present invention methods are provided for treating 
neovascular diseases of the eye such as corneal neovascularization (including corneal graft 
neovascularization), comprising the step of administering to a patient a therapeutically 
effective amount of a compound (as described above) to the cornea, such that the 
formation of blood vessels is inhibited. Briefly, the cornea is a tissue which normally lacks 
blood vessels. In certain pathological conditions however, capillaries may extend into the 
cornea from the pericorneal vascular plexus of the limbus. When the cornea becomes 
vascularized, it also becomes clouded, resulting in a decline in the patient's visual acuity. 
Visual loss may become complete if the cornea completely opacitates. A wide variety of 
disorders can result in corneal neovascularization, including for example, corneal 
infections (e.g., trachoma, herpes simplex keratitis, leishmaniasis and onchocerciasis), 
immunological processes (e.g., graft rejection and Stevens-Johnson's syndrome), alkali 
burns, trauma, inflammation (of any cause), toxic and nutritional deficiency states, and as a 
complication of wearing contact lenses. 

Within particularly preferred embodiments of the invention, may be prepared for 
topical administration in saline (combined with any of the preservatives and antimicrobial 
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agents commonly used in ocular preparations), and administered in eyedrop form. The 
solution or suspension may be prepared in its pure form and administered several times 
daily. Alternatively, anti -angiogenic compositions, prepared as described above, may also 
be administered directly to the cornea. Within preferred embodiments, the anti-angiogenic 
5 composition is prepared with a muco-adhesive polymer which binds to cornea. Within 
further embodiments, the anti-angiogenic factors or anti-angiogenic compositions may be 
utilized as an adjunct to conventional steroid therapy. Topical therapy may also be useful 
prophylactically in corneal lesions which are known to have a high probability of inducing 
an angiogenic response (such as chemical burns). In these instances the treatment, likely 
10 in combination with steroids, may be instituted immediately to help prevent subsequent 
complications. 

Within other embodiments, the compounds described above may be injected 
directly into the corneal stroma by an ophthalmologist under microscopic guidance. The 
preferred site of injection may vary with the morphology of the individual lesion, but the 

15 goal of the administration would be to place the composition at the advancing front of the 
vasculature (i.e., interspersed between the blood vessels and the normal cornea). In most 
cases this would involve perilimbic corneal injection to "protect" the cornea from the 
advancing blood vessels. This method may also be utilized shortly after a corneal insult in 
order to prophylactically prevent corneal neovascularization. In this situation the material 

20 could be injected in the perilimbic cornea interspersed between the corneal lesion and its 
undesired potential limbic blood supply. Such methods may also be utilized in a similar 
fashion to prevent capillary invasion of transplanted corneas. In a sustained-release form 
injections might only be required 2-3 times per year. A steroid could also be added to the 
injection solution to reduce inflammation resulting from the injection itself. 

25 Within another aspect of the present invention, methods are provided for treating 

neovascular glaucoma, comprising the step of administering to a patient a therapeutically 
effective amount of a polynucleotide, polypeptide, antagonist and/or agonist to the eye, 
such that the formation of blood vessels is inhibited. In one embodiment, the compound 

SUBSTITUTE SHEET (RULE 26) 


WO 00/29435 PCT/US99/25031 

422 

may be administered topically to the eye in order to treat early forms of neovascular 
glaucoma. Within other embodiments, the compound may be implanted by injection into 
the region of the anterior chamber angle. Within other embodiments, the compound may 
also be placed in any location such that the compound is continuously released into the 
5 aqueous humor. Within another aspect of the present invention, methods are provided for 
treating proliferative diabetic retinopathy, comprising the step of administering to a patient 
a therapeutically effective amount of a polynucleotide, polypeptide, antagonist and/or 
agonist to the eyes, such that the formation of blood vessels is inhibited. 

Within particularly preferred embodiments of the invention, proliferative diabetic 

10 retinopathy may be treated by injection into the aqueous humor or the vitreous, in order to 
increase the local concentration of the polynucleotide, polypeptide, antagonist and/or 
agonist in the retina. Preferably, this treatment should be initiated prior to the acquisition 
of severe disease requiring photocoagulation. 

Within another aspect of the present invention, methods are provided for treating 

15 retrolental fibroplasia, comprising the step of administering to a patient a therapeutically 
effective amount of a polynucleotide, polypeptide, antagonist and/or agonist to the eye, 
such that the formation of blood vessels is inhibited. The compound may be administered 
topically, via intravitreous injection and/or via intraocular implants. 

Additionally, disorders which can be treated with the polynucleotides, 

20 polypeptides, agonists and/or agonists include, but are not limited to, hemangioma, 
arthritis, psoriasis, angiofibroma, atherosclerotic plaques, delayed wound healing, 
granulations, hemophilic joints, hypertrophic scars, nonunion fractures, Osier-Weber 
syndrome, pyogenic granuloma, scleroderma, trachoma, and vascular adhesions. 

Moreover, disorders and/or states, which can be treated with be treated with the the 

25 polynucleotides, polypeptides, agonists and/or agonists include, but are not limited to, solid 
tumors, blood born tumors such as leukemias, tumor metastasis, Kaposi's sarcoma, benign 
tumors, for example hemangiomas, acoustic neuromas, neurofibromas, trachomas, and 
pyogenic granulomas, rheumatoid arthritis, psoriasis, ocular angiogenic diseases, for 
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example, diabetic retinopathy, retinopathy of prematurity, macular degeneration, corneal 
graft rejection, neovascular glaucoma, retrolental fibroplasia, rubeosis, retinoblastoma, and 
uvietis, delayed wound healing, endometriosis, vascluogenesis, granulations, hypertrophic 
scars (keloids), nonunion fractures, scleroderma, trachoma, vascular adhesions, myocardial 
angiogenesis, coronary collaterals, cerebral collaterals, arteriovenous malformations, 
ischemic limb angiogenesis, Osier-Webber Syndrome, plaque neovascularization, 
telangiectasia, hemophiliac joints, angiofibroma fibromuscular dysplasia, wound 
granulation, Crohn's disease, atherosclerosis, birth control agent by preventing 
vascularization required for embryo implantation controlling menstruation, diseases that 
have angiogenesis as a pathologic consequence such as cat scratch disease (Rochele 
minalia quintosa), ulcers (Helicobacter pylori), Bartonellosis and bacillary angiomatosis. 

In one aspect of the birth control method, an amount of the compound sufficient to 
block embryo implantation is administered before or after intercourse and fertilization have 
occurred, thus providing an effective method of birth control, possibly a "morning after" 
method. Polynucleotides, polypeptides, agonists and/or agonists may also be used in 
controlling menstruation or administered as either a peritoneal lavage fluid or for 
peritoneal implantation in the treatment of endometriosis. 

Polynucleotides, polypeptides, agonists and/or agonists of the present invention 
may be incorporated into surgical sutures in order to prevent stitch granulomas. 

Polynucleotides, polypeptides, agonists and/or agonists may be utilized in a wide 
variety of surgical procedures. For example, within one aspect of the present invention a 
compositions (in the form of, for example, a spray or film) may be utilized to coat or spray 
an area prior to removal of a tumor, in order to isolate normal surrounding tissues from 
malignant tissue, and/or to prevent the spread of disease to surrounding tissues. Within 
other aspects of the present invention, compositions (e.g., in the form of a spray) may be 
delivered via endoscopic procedures in order to coat tumors, or inhibit angiogenesis in a 
desired locale. Within yet other aspects of the present invention, surgical meshes which 
have been coated with anti- angiogenic compositions of the present invention may be 
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utilized in any procedure wherein a surgical mesh might be utilized. For example, within 
one embodiment of the invention a surgical mesh laden with an anti-angiogenic 
composition may be utilized during abdominal cancer resection surgery (e.g., subsequent 
to colon resection) in order to provide support to the structure, and to release an amount of 
the anti-angiogenic factor. 

Within further aspects of the present invention, methods are provided for treating 
tumor excision sites, comprising administering a polynucleotide, polypeptide, agonist 
and/or agonist to the resection margins of a tumor subsequent to excision, such that the 
local recurrence of cancer and the formation of new blood vessels at the site is inhibited. 
Within one embodiment of the invention, the anti-angiogenic compound is administered 
directly to the tumor excision site (e.g., applied by swabbing, brushing or otherwise 
coating the resection margins of the tumor with the anti-angiogenic compound). 
Alternatively, the anti-angiogenic compounds may be incorporated into known surgical 
pastes prior to administration. Within particularly preferred embodiments of the invention, 
the anti-angiogenic compounds are applied after hepatic resections for malignancy, and 
after neurosurgical operations. 

Within one aspect of the present invention, polynucleotides, polypeptides, agonists 
and/or agonists may be administered to the resection margin of a wide variety of tumors, 
including for example, breast, colon, brain and hepatic tumors. For example, within one 
embodiment of the invention, anti-angiogenic compounds may be administered to the site 
of a neurological tumor subsequent to excision, such that the formation of new blood 
vessels at the site are inhibited. 

The polynucleotides, polypeptides, agonists and/or agonists of the present invention 
may also be administered along with other anti-angiogenic factors. Representative 
examples of other anti-angiogenic factors include: Anti -Invasive Factor, retinoic acid and 
derivatives thereof, paclitaxel, Suramin, Tissue Inhibitor of Metalloproteinase-1, Tissue 
Inhibitor of Metal loproteinase-2, Plasminogen Activator Inhibitor-!, Plasminogen 
Activator Inhibitor-2, and various forms of the lighter "d group" transition metals. 
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Lighter "d group" transition metals include, for example, vanadium, molybdenum, 
tungsten, titanium, niobium, and tantalum species. Such transition metal species may form 
transition metal complexes. Suitable complexes of the above-mentioned transition metal 
species include oxo transition metal complexes. 

Representative examples of vanadium complexes include oxo vanadium complexes 
such as vanadate and vanadyl complexes. Suitable vanadate complexes include 
metavanadate and orthovanadate complexes such as, for example, ammonium 
metavanadate, sodium metavanadate, and sodium orthovanadate. Suitable vanadyl 
complexes include, for example, vanadyl acetylacetonate and vanadyl sulfate including 
vanadyl sulfate hydrates such as vanadyl sulfate mono- and trihydrates. 

Representative examples of tungsten and molybdenum complexes also include oxo 
complexes. Suitable oxo tungsten complexes include tungstate and tungsten oxide 
complexes. Suitable tungstate complexes include ammonium tungstate, calcium tungstate, 
sodium tungstate dihydrate, and tungstic acid. Suitable tungsten oxides include tungsten 
(IV) oxide and tungsten (VI) oxide. Suitable oxo molybdenum complexes include 
molybdate, molybdenum oxide, and molybdenyl complexes. Suitable molybdate 
complexes include ammonium molybdate and its hydrates, sodium molybdate and its 
hydrates, and potassium molybdate and its hydrates. Suitable molybdenum oxides include 
molybdenum (VI) oxide, molybdenum (VI) oxide, and molybdic acid. Suitable 
molybdenyl complexes include, for example, molybdenyl acetylacetonate. Other suitable 
tungsten and molybdenum complexes include hydroxo derivatives derived from, for 
example, glycerol, tartaric acid, and sugars. 

A wide variety of other anti-angiogenic factors may also be utilized within the 
context of the present invention. Representative examples include platelet factor 4; 
protamine sulphate; sulphated chitin derivatives (prepared from queen crab shells), 
(Murata et al., Cancer Res. 51:22-26, 1991); Sulphated Polysaccharide Peptidoglycan 
Complex (SP- PG) (the function of this compound may be enhanced by the presence of 
steroids such as estrogen, and tamoxifen citrate); Staurosporine; modulators of matrix 
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metabolism, including for example, proline analogs, cishydroxyproline, d,L-3,4- 
dehydroproline, Thiaproline, alpha,afpha-dipyridyl, aminopropionitrile fumarate; 4-propyl- 
5-(4-pyridinyl)-2(3H)-oxazolone; Methotrexate; Mitoxantrone; Heparin; Interferons; 2 
Macroglobulin -serum; ChIMP-3 (Pavloff et al., J. Bio. Chem. 267:17321-17326, 1992); 
5 Chymostatin (Tomkinson et al., Biochem J. 286:475-480, 1992); Cyclodextrin 
Tetradecasulfate; Eponemycin; Camptothecin; Fumagillin (Ingber et al., Nature 348:555- 
557, 1990); Gold Sodium Thiomalate ("GST"; Matsubara and Ziff, J. Cli n. Invest. 
79:1440-1446, 1987); anticollagenase-serum; alpha2-antiplasmin (Holmes et al., J. Biol. 
Chem. 262(4): 1659-1664, 1987); Bisantrene (National Cancer Institute); Lobenzarit 
10 disodium (N-(2)-carboxyphenyl-4- chloroanthronilic acid disodium or "CCA"; Takeuchi et 
ah, Agents Actions 36:312-316, 1992); Thalidomide; Angostatic steroid; AGM-1470; 
carboxynaminolmidazole; and metalloproteinase inhibitors such as BB94. 

Diseases at the Cellular Level 

* 5 Diseases associated with increased cell survival or the inhibition of apoptosis that 

could be treated or detected by the polynucleotides or polypeptides and/or antagonists or 
agonists of the invention, include cancers (such as follicular lymphomas, carcinomas with 
p53 mutations, and hormone-dependent tumors, including, but not limited to colon cancer, 
cardiac tumors, pancreatic cancer, melanoma, retinoblastoma, glioblastoma, lung cancer, 

20 intestinal cancer, testicular cancer, stomach cancer, neuroblastoma, myxoma, myoma, 
lymphoma, endothelioma, osteoblastoma, osteoclastoma, osteosarcoma, chondrosarcoma, 
adenoma, breast cancer, prostate cancer, Kaposi's sarcoma and ovarian cancer); 
autoimmune disorders (such as, multiple sclerosis, Sjogren's syndrome, Hashimoto's 
thyroiditis, biliary cirrhosis, Behcet's disease, Crohn's disease, polymyositis, systemic 

25 lupus erythematosus and immune-related glomerulonephritis and rheumatoid arthritis) and 
viral infections (such as herpes viruses, pox viruses and adenoviruses), inflammation, graft 
v. host disease, acute graft rejection, and chronic graft rejection. In preferred 
embodiments, the polynucleotides or polypeptides, and/or agonists or antagonists of the 
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invention are used to inhibit growth, progression, and/or metasis of cancers, in particular 
those listed above. 

Additional diseases or conditions associated with increased cell survival that could 
be treated or detected by the polynucleotides or polypeptides, or agonists or antagonists of 
5 the invention, include, but are not limited to, progression, and/or metastases of 
malignancies and related disorders such as leukemia (including acute leukemias (e.g., 
acute lymphocytic leukemia, acute myelocytic leukemia (including myeloblasts, 
promyelocytic, myelomonocytic, monocytic, and erythroleukemia)) and chronic leukemias 
(e.g., chronic myelocytic (granulocytic) leukemia and chronic lymphocytic leukemia)), 

10 polycythemia vera, lymphomas (e.g., Hodgkin's disease and non-Hodgkin's disease), 
multiple myeloma, Waldenstrom's macroglobulinemia, heavy chain disease, and solid 
tumors including, but not limited to, sarcomas and carcinomas such as fibrosarcoma, 
myxosarcoma, liposarcoma, chondrosarcoma, osteogenic sarcoma, chordoma, 
angiosarcoma, endotheliosarcoma, lymphangiosarcoma, lymphangioendotheliosarcoma, 

15 synovioma, mesothelioma, Ewing's tumor, leiomyosarcoma, rhabdomyosarcoma, colon 
carcinoma, pancreatic cancer, breast cancer, ovarian cancer, prostate cancer, squamous cell 
carcinoma, basal cell carcinoma, adenocarcinoma, sweat gland carcinoma, sebaceous gland 
carcinoma, papillary carcinoma, papillary adenocarcinomas, cystadenocarcinoma, 
medullary carcinoma, bronchogenic carcinoma, renal cell carcinoma, hepatoma, bile duct 

20 carcinoma, choriocarcinoma, seminoma, embryonal carcinoma, Wilm's tumor, cervical 
cancer, testicular tumor, lung carcinoma, small cell lung carcinoma, bladder carcinoma, 
epithelial carcinoma, glioma, astrocytoma, medulloblastoma, craniopharyngioma, 
ependymoma, pinealoma, hemangioblastoma, acoustic neuroma, oligodendroglioma, 
menangioma, melanoma, neuroblastoma, and retinoblastoma. 

25 Diseases associated with increased apoptosis that could be treated or detected by 

the polynucleotides or polypeptides, and/or agonists or antagonists of the invention, 
include AIDS; neurodegenerative disorders (such as Alzheimer's disease, Parkinson's 
disease, Amyotrophic lateral sclerosis, Retinitis pigmentosa, Cerebellar degeneration and 
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brain tumor or prior associated disease); autoimmune disorders (such as, multiple sclerosis, 
Sjogren's syndrome, Hashimoto's thyroiditis, biliary cirrhosis, Behcet's disease, Crohn's 
disease, polymyositis, systemic lupus erythematosus and immune-related 
glomerulonephritis and rheumatoid arthritis) myelodysplastic syndromes (such as aplastic 
5 anemia), graft v. host disease, ischemic injury (such as that caused by myocardial 
infarction, stroke and reperfusion injury), liver injury (e.g., hepatitis related liver injury, 
ischemia/reperfusion injury, cholestosis (bile duct injury) and liver cancer); toxin-induced 
liver disease (such as that caused by alcohol), septic shock, cachexia and anorexia. 

10 Wound Healing and Epithelial Cell Proliferation 

In accordance with yet a further aspect of the present invention, there is provided a 
process for utilizing the polynucleotides or polypeptides, and/or agonists or antagonists of 
the invention, for therapeutic purposes, for example, to stimulate epithelial cell 
proliferation and basal keratinocytes for the purpose of wound healing, and to stimulate 

15 hair follicle production and healing of dermal wounds. Polynucleotides or polypeptides, as 
well as agonists or antagonists of the invention, may be clinically useful in stimulating 
wound healing including surgical wounds, excisional wounds, deep wounds involving 
damage of the dermis and epidermis, eye tissue wounds, dental tissue wounds, oral cavity 
wounds, diabetic ulcers, dermal ulcers, cubitus ulcers, arterial ulcers, venous stasis ulcers, 

20 burns resulting from heat exposure or chemicals, and other abnormal wound healing 
conditions such as uremia, malnutrition, vitamin deficiencies and complications associted 
with systemic treatment with steroids, radiation therapy and antineoplastic drugs and 
antimetabolites. Polynucleotides or polypeptides, and/or agonists or antagonists of the 
invention, could be used to promote dermal reestablishment subsequent to dermal loss 

25 The polynucleotides or polypeptides, and/or agonists or antagonists of the 

invention, could be used to increase the adherence of skin grafts to a wound bed and to 
stimulate re-epithelialization from the wound bed. The following are a non-exhaustive list 
of grafts that polynucleotides or polypeptides, agonists or antagonists of the invention, 
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could be used to increase adherence to a wound bed: autografts, artificial skin, allografts, 
autodermic graft, autoepdermic grafts, avacular grafts, Blair-Brown grafts, bone graft, 
brephoplastic grafts, cutis graft, delayed graft, dermic graft, epidermic graft, fascia graft, 
full thickness graft, heterologous graft, xenograft, homologous graft, hyperplastic graft, 
lamellar graft, mesh graft, mucosal graft, Ollier-Thiersch graft, omenpal graft, patch graft, 
pedicle graft, penetrating graft, split skin graft, thick split graft. The polynucleotides or 
polypeptides, and/or agonists or antagonists of the invention, can be used to promote skin 
strength and to improve the appearance of aged skin. 

It is believed that the polynucleotides or polypeptides, and/or agonists or 
antagonists of the invention, will also produce changes in hepatocyte proliferation, and 
epithelial cell proliferation in the lung, breast, pancreas, stomach, small intesting, and large 
intestine. The polynucleotides or polypeptides, and/or agonists or antagonists of the 
invention, could promote proliferation of epithelial cells such as sebocytes, hair follicles, 
hepatocytes, type II pneumocytes, mucin-producing goblet cells, and other epithelial cells 
and their progenitors contained within the skin, lung, liver, and gastrointestinal tract. The 
polynucleotides or polypeptides, and/or agonists or antagonists of the invention, may 
promote proliferation of endothelial cells, keratinocytes, and basal keratinocytes. 

The polynucleotides or polypeptides, and/or agonists or antagonists of the 
invention, could also be used to reduce the side effects of gut toxicity that result from 
radiation, chemotherapy treatments or viral infections. The polynucleotides or 
polypeptides, and/or agonists or antagonists of the invention, may have a cytoprotective 
effect on the small intestine mucosa. The polynucleotides or polypeptides, and/or agonists 
or antagonists of the invention, may also stimulate healing of mucositis (mouth ulcers) that 
result from chemotherapy and viral infections. 

The polynucleotides or polypeptides, and/or agonists or antagonists of the 
invention, could further be used in full regeneration of skin in full and partial thickness 
skin defects, including burns, (i.e., repopulation of hair follicles, sweat glands, and 
sebaceous glands), treatment of other skin defects such as psoriasis. The polynucleotides or 
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polypeptides, and/or agonists or antagonists of the invention, could be used to treat 
epidermolysis bullosa, a defect in adherence of the epidermis to the underlying dermis 
which results in frequent, open and painful blisters by accelerating reepithelialization of 
these lesions. The polynucleotides or polypeptides, and/or agonists or antagonists of the 
invention, could also be used to treat gastric and doudenal ulcers and help heal by scar 
formation of the mucosal lining and regeneration of glandular mucosa and duodenal 
mucosal lining more rapidly. Inflamamatory bowel diseases, such as Crohn's disease and 
ulcerative colitis, are diseases which result in destruction of the mucosal surface of the 
small or large intestine, respectively. Thus, the polynucleotides or polypeptides, and/or 
agonists or antagonists of the invention, couid be used to promote the resurfacing of the 
mucosal surface to aid more rapid healing and to prevent progression of inflammatory 
bowel disease. Treatment with the polynucleotides or polypeptides, and/or agonists or 
antagonists of the invention, is expected to have a significant effect on the production of 
mucus throughout the gastrointestinal tract and could be used to protect the intestinal 
mucosa from injurious substances that are ingested or following surgery. The 
polynucleotides or polypeptides, and/or agonists or antagonists of the invention, could be 
used to treat diseases associate with the under expression of the polynucleotides of the 
invention. 

Moreover, the polynucleotides or polypeptides, and/or agonists or antagonists of the 
invention, could be used to prevent and heal damage to the lungs due to various 
pathological states. A growth factor such as the polynucleotides or polypeptides, and/or 
agonists or antagonists of the invention, which could stimulate proliferation and 
differentiation and promote the repair of alveoli and brochiolar epithelium to prevent or 
treat acute or chronic lung damage. For example, emphysema, which results in the 
progressive loss of aveoli, and inhalation injuries, i.e., resulting from smoke inhalation and 
burns, that cause necrosis of the bronchiolar epithelium and alveoli could be effectively 
treated using the polynucleotides or polypeptides, and/or agonists or antagonists of the 
invention. Also, the polynucleotides or polypeptides, and/or agonists or antagonists of the 
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invention, could be used to stimulate the proliferation of and differentiation of type II 
pneumocytes, which may help treat or prevent disease such as hyaline membrane diseases, 
such as infant respiratory distress syndrome and bronchopulmonary displasia, in premature 
infants. 

5 The polynucleotides or polypeptides, and/or agonists or antagonists of the 

invention, could stimulate the proliferation and differentiation of hepatocytes and, thus, 
could be used to alleviate or treat liver diseases and pathologies such as fulminant liver 
failure caused by cirrhosis, liver damage caused by viral hepatitis and toxic substances 
(i.e., acetaminophen, carbon tetraholoride and other hepatotoxins known in the art). 

10 In addition, the polynucleotides or polypeptides, and/or agonists or antagonists of 

the invention, could be used treat or prevent the onset of diabetes mellitus. In patients with 
newly diagnosed Types I and II diabetes, where some islet cell function remains, the 
polynucleotides or polypeptides, and/or agonists or antagonists of the invention, could be 
used to maintain the islet function so as to alleviate, delay or prevent permanent 

15 manifestation of the disease. Also, the polynucleotides or polypeptides, and/or agonists or 
antagonists of the invention, could be used as an auxiliary in islet cell transplantation to 
improve or promote islet cell function. 

Infectious Disease 

20 A polypeptide or polynucleotide and/or agonist or antagonist of the present 

invention can be used to treat or detect infectious agents. For example, by increasing the 
immune response, particularly increasing the proliferation and differentiation of B and/or T 
cells, infectious diseases may be treated. The immune response may be increased by either 
enhancing an existing immune response, or by initiating a new immune response. 

25 Alternatively, polypeptide or polynucleotide and/or agonist or antagonist of the present 
invention may also directly inhibit the infectious agent, without necessarily eliciting an 
immune response. 
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Viruses are one example of an infectious agent that can cause disease or symptoms 
that can be treated or detected by a polynucleotide or polypeptide and/or agonist or 
antagonist of the present invention. Examples of viruses, include, but are not limited to 
Examples of viruses, include, but are not limited to the following DNA and RNA viruses 
and viral families: Arbovirus, Adenoviridae, Arenaviridae, Arterivirus, Birnaviridae, 
Bunyaviridae, Caliciviridae, Circoviridae, Coronaviridae, Dengue, EBV, HIV, 
Flaviviridae, Hepadnaviridae (Hepatitis), Herpes viridae (such as, Cytomegalovirus, Herpes 
Simplex, Herpes Zoster), Mononegavirus (e.g., Paramyxo viridae, Morbilli virus, 
Rhabdoviridae), Orthomyxo viridae (e.g., Influenza A, Influenza B, and parainfluenza), 
Papiloma virus, Papov a viridae, Parvoviridae, Picornaviridae, Poxviridae (such as 
Smallpox or Vaccinia), Reovtridae (e.g., Rotavirus), Retroviridae (HTLV-I, HTLV-II, 
Lenti virus), and Togaviridae (e.g., Rubi virus). Viruses falling within these families can 
cause a variety of diseases or symptoms, including, but not limited to: arthritis, 
bronchiol litis, respiratory syncytial virus, encephalitis, eye infections (e.g., conjunctivitis, 
keratitis), chronic fatigue syndrome, hepatitis (A, B, C, E, Chronic Active, Delta), 
Japanese B encephalitis, Junin, Chikungunya, Rift Valley fever, yellow fever, meningitis, 
opportunistic infections (e.g., AIDS), pneumonia, Burkitt's Lymphoma, chickenpox, 
hemorrhagic fever, Measles, Mumps, Parainfluenza, Rabies, the common cold, Polio, 
leukemia, Rubella, sexually transmitted diseases, skin diseases (e.g., Kaposi's, warts), and 
viremia. polynucleotides or polypeptides, or agonists or antagonists of the invention, can 
be used to treat or detect any of these symptoms or diseases. In specific embodiments, 
polynucleotides, polypeptides, or agonists or antagonists of the invention are used to treat: 
meningitis, Dengue, EBV, and/or hepatitis (e.g., hepatitis B). In an additional specific 
embodiment polynucleotides, polypeptides, or agonists or antagonists of the invention are 
used to treat patients nonresponsive to one or more other commercially available hepatitis 
vaccines. In a further specific embodiment polynucleotides, polypeptides, or agonists or 
antagonists of the invention are used to treat AIDS. 
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Similarly, bacterial or fungal agents that can cause disease or symptoms and that 
can be treated or detected by a polynucleotide or polypeptide and/or agonist or antagonist 
of the present invention include, but not limited to, include, but not limited to, the 
following Gram-Negative and Gram-positive bacteria and bacterial families and fungi: 
Actinomycetales (e.g., Cory nebacteri urn, Mycobacterium, Norcardia), Cryptococcus 
neoformans, Aspergillosis, Bacillaceae (e.g., Anthrax, Clostridium), Bacteroidaceae, 
Blastomycosis, Bordetella, Borrelia (e.g., Borrelia burgdorferi, Brucellosis, Candidiasis, 
Campylobacter, Coccidioidomycosis, Cryptococcosis, Dermatocycoses, E. coli (e.g., 
Enterotoxigenic E. coli and Enterohemorrhagic E. coli), Enterobacteriaceae (Klebsiella, 
Salmonella (e.g., Salmonella typhi, and Salmonella paratyphi), Serratia, Yersinia), 
Erysipelothrix, Helicobacter, Legionellosis, Leptospirosis, Listeria, Mycoplasmatales, 
Mycobacterium leprae, Vibrio cholerae, Neisseriaceae (e.g., Acinetobacter, Gonorrhea, 
Menigococcal), Meisseria meningitidis, Pasteurellacea Infections (e.g., Actinobaciilus, 
Heamophilus (e.g., Heamophilus influenza type B), Pasteurella), Pseudomonas, 
Rickettsiaceae, Chlamydiaceae, Syphilis, Shigella spp., Staphylococcal, Meningococcal, 
Pneumococcal and Streptococcal (e.g., Streptococcus pneumoniae and Group B 
Streptococcus). These bacterial or fungal families can cause the following diseases or 
symptoms, including, but not limited to: bacteremia, endocarditis, eye infections 
(conjunctivitis, tuberculosis, uveitis), gingivitis, opportunistic infections (e.g., AIDS 
related infections), paronychia, prosthesis-related infections, Reiter's Disease, respiratory 
tract infections, such as Whooping Cough or Empyema, sepsis, Lyme Disease, Cat-Scratch 
Disease, Dysentery, Paratyphoid Fever, food poisoning, Typhoid, pneumonia, Gonorrhea, 
meningitis (e.g., mengitis types A and B), Chlamydia, Syphilis, Diphtheria, Leprosy, 
Paratuberculosis, Tuberculosis, Lupus, Botulism, gangrene, tetanus, impetigo, Rheumatic 
Fever, Scarlet Fever, sexually transmitted diseases, skin diseases (e.g., cellulitis, 
dermatocycoses), toxemia, urinary tract infections, wound infections. Polynucleotides or 
polypeptides, agonists or antagonists of the invention, can be used to treat or detect any of 
these symptoms or diseases. In specific embodiments, Ppolynucleotides, polypeptides, 
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agonists or antagonists of the invention are used to treat: tetanus, Diptheria, botulism, 
and/or meningitis type B. 

Moreover, parasitic agents causing disease or symptoms that can be treated or 
detected by a polynucleotide or polypeptide and/or agonist or antagonist of the present 
invention include, but not limited to, the following families or class: Amebiasis, 
Babesiosis, Coccidiosis, Cryptosporidiosis, Dientamoebiasis, Dourine, Ectoparasitic, 
Giardiasis, Helminthiasis, Leishmaniasis, Theileriasis, Toxoplasmosis, Trypanosomiasis, 
and Trichomonas and Sporozoans (e.g., Plasmodium virax, Plasmodium falciparium, 
Plasmodium malariae and Plasmodium ovale). These parasites can cause a variety of 
diseases or symptoms, including, but not limited to: Scabies, Trombiculiasis, eye 
infections, intestinal disease (e.g., dysentery, giardiasis), liver disease, lung disease, 
opportunistic infections (e.g., AIDS related), malaria, pregnancy complications, and 
toxoplasmosis, polynucleotides or polypeptides, or agonists or antagonists of the invention, 
can be used to treat or detect any of these symptoms or diseases. In specific embodiments, 
polynucleotides, polypeptides, or agonists or antagonists of the invention are used to treat 
malaria. 

Preferably, treatment using a polypeptide or polynucleotide and/or agonist or 
antagonist of the present invention could either be by administering an effective amount of 
a polypeptide to the patient, or by removing cells from the patient, supplying the cells with 
a polynucleotide of the present invention, and returning the engineered cells to the patient 
(ex vivo therapy). Moreover, the polypeptide or polynucleotide of the present invention 
can be used as an antigen in a vaccine to raise an immune response against infectious 
disease. 

Regeneration 

A polynucleotide or polypeptide and/or agonist or antagonist of the present 
invention can be used to differentiate, proliferate, and attract cells, leading to the 
regeneration of tissues. (See, Science 276:59-87 (1997).) The regeneration of tissues 
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could be used to repair, replace, or protect tissue damaged by congenital defects, trauma 
(wounds, bums, incisions, or ulcers), age, disease (e.g. osteoporosis, osteocarthritis, 
periodontal disease, liver failure), surgery, including cosmetic plastic surgery, fibrosis, 
reperfusion injury, or systemic cytokine damage. 

Tissues that could be regenerated using the present invention include organs (e.g., 
pancreas, liver, intestine, kidney, skin, endothelium), muscle (smooth, skeletal or cardiac), 
vasculature (including vascular and lymphatics), nervous, hematopoietic, and skeletal 
(bone, cartilage, tendon, and ligament) tissue. Preferably, regeneration occurs without or 
decreased scarring. Regeneration also may include angiogenesis. 

Moreover, a polynucleotide or polypeptide and/or agonist or antagonist of the 
present invention may increase regeneration of tissues difficult to heal. For example, 
increased tendon/ligament regeneration would quicken recovery time after damage. A 
polynucleotide or polypeptide and/or agonist or antagonist of the present invention could 
also be used prophylactically in an effort to avoid damage. Specific diseases that could be 
treated include of tendinitis, carpal tunnel syndrome, and other tendon or ligament defects. 
A further example of tissue regeneration of non-healing wounds includes pressure ulcers, 
ulcers associated with vascular insufficiency, surgical, and traumatic wounds. 

Similarly, nerve and brain tissue could also be regenerated by using a 
polynucleotide or polypeptide and/or agonist or antagonist of the present invention to 
proliferate and differentiate nerve cells. Diseases that could be treated using this method 
include central and peripheral nervous system diseases, neuropathies, or mechanical and 
traumatic disorders (e.g., spinal cord disorders, head trauma, cerebrovascular disease, and 
stoke). Specifically, diseases associated with peripheral nerve injuries, peripheral 
neuropathy (e.g., resulting from chemotherapy or other medical therapies), localized 
neuropathies, and central nervous system diseases (e.g., Alzheimer's disease, Parkinson's 
disease, Huntington's disease, amyotrophic lateral sclerosis, and Shy-Drager syndrome), 
could all be treated using the polynucleotide or polypeptide and/or agonist or antagonist of 
the present invention. 
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Chemotaxis 

A polynucleotide or polypeptide and/or agonist or antagonist of the present 
invention may have chemotaxis activity. A chemotaxic molecule attracts or mobilizes 
5 cells (e.g., monocytes, fibroblasts, neutrophils, T-ceils, mast cells, eosinophils, epithelial 
and/or endothelial cells) to a particular site in the body, such as inflammation, infection, or 
site of hyperproliferation. The mobilized cells can then fight off and/or heal the particular 
trauma or abnormality. 

A polynucleotide or polypeptide and/or agonist or antagonist of the present 
10 invention may increase chemotaxic activity of particular cells. These chemotactic 

molecules can then be used to treat inflammation, infection, hyperproliferative disorders, 
or any immune system disorder by increasing the number of cells targeted to a particular 
location in the body. For example, chemotaxic molecules can be used to treat wounds and 
other trauma to tissues by attracting immune cells to the injured location. Chemotactic 
15 molecules of the present invention can also attract fibroblasts, which can be used to treat 
wounds. 

It is also contemplated that a polynucleotide or polypeptide and/or agonist or 
antagonist of the present invention may inhibit chemotactic activity. These molecules 
could also be used to treat disorders. Thus, a polynucleotide or polypeptide and/or agonist 
20 or antagonist of the present invention could be used as an inhibitor of chemotaxis. 

Binding Activity 

A polypeptide of the present invention may be used to screen for molecules that 
bind to the polypeptide or for molecules to which the polypeptide binds. The binding of 
25 the polypeptide and the molecule may activate (agonist), increase, inhibit (antagonist), or 
decrease activity of the polypeptide or the molecule bound. Examples of such molecules 
include antibodies, oligonucleotides, proteins (e.g., receptors),or small molecules. 
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Preferably, the molecule is closely related to the natural ligand of the polypeptide, 
e.g., a fragment of the ligand, or a natural substrate, a ligand, a structural or functional 
mimetic. (See, Coligan et al„ Current Protocols in Immunology 1(2): Chapter 5 (1991).) 
Similarly, the molecule can be closely related to the natural receptor to which the 
polypeptide binds, or at least, a fragment of the receptor capable of being bound by the 
polypeptide (e.g., active site). In either case, the molecule can be rationally designed using 
known techniques. 

Preferably, the screening for these molecules involves producing appropriate cells 
which express the polypeptide, either as a secreted protein or on the cell membrane. 
Preferred cells include cells from mammals, yeast, Drosophila, or E. coll. Cells expressing 
the polypeptide (or cell membrane containing the expressed polypeptide) are then 
preferably contacted with a test compound potentially containing the molecule to observe 
binding, stimulation, or inhibition of activity of either the polypeptide or the molecule. 

The assay may simply test binding of a candidate compound to the polypeptide, 
wherein binding is detected by a label, or in an assay involving competition with a labeled 
competitor. Further, the assay may test whether the candidate compound results in a signal 
generated by binding to the polypeptide. 

Alternatively, the assay can be carried out using cell-free preparations, 
polypeptide/molecule affixed to a solid support, chemical libraries, or natural product 
mixtures. The assay may also simply comprise the steps of mixing a candidate compound 
with a solution containing a polypeptide, measuring polypeptide/molecule activity or 
binding, and comparing the polypeptide/molecule activity or binding to a standard. 

Preferably, an ELISA assay can measure polypeptide level or activity in a sample 
(e.g., biological sample) using a monoclonal or polyclonal antibody. The antibody can 
measure polypeptide level or activity by either binding, directly or indirectly, to the 
polypeptide or by competing with the polypeptide for a substrate. 

Additionally, the receptor to which a polypeptide of the invention binds can be 
identified by numerous methods known to those of skill in the art, for example, ligand 
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panning and FACS sorting (Coligan, et al., Current Protocols in Immun., 1(2), Chapter 5, 
(1991)). For example, expression cloning is employed wherein polyadenylated RNA is 
prepared from a cell responsive to the polypeptides, for example, NIH3T3 cells which are 
known to contain multiple receptors for the FGF family proteins, and SC-3 cells, and a 
cDNA library created from this RNA is divided into pools and used to transfect COS cells 
or other cells that are not responsive to the polypeptides. Transfected cells which are 
grown on glass slides are exposed to the polypeptide of the present invention, after they 
have been labelled. The polypeptides can be labeled by a variety of means including 
iodination or inclusion of a recognition site for a site-specific protein kinase. 

Following fixation and incubation, the slides are subjected to auto-radiographic 
analysis. Positive pools are identified and sub-pools are prepared and re-transfected using 
an iterative sub-pooling and re-screening process, eventually yielding a single clones that 
encodes the putative receptor. 

As an alternative approach for receptor identification, the labeled polypeptides can 
be photoaffinity linked with cell membrane or extract preparations that express the 
receptor molecule. Cross-linked material is resolved by PAGE analysis and exposed to X- 
ray film. The labeled complex containing the receptors of the polypeptides can be excised, 
resolved into peptide fragments, and subjected to protein microsequencing. The amino 
acid sequence obtained from microsequencing would be used to design a set of degenerate 
oligonucleotide probes to screen a cDNA library to identify the genes encoding the 
putative receptors. 

Moreover, the techniques of gene-shuffling, motif- shuffling, exon -shuffling, and/or 
codon-shuffling (collectively referred to as "DNA shuffling") may be employed to 
modulate the activities of polypeptides of the invention thereby effectively generating 
agonists and antagonists of polypeptides of the invention. See generally, U.S. Patent Nos. 
5,605,793, 5,811,238, 5,830,721, 5,834,252, and 5,837,458, and Patten, P. A., et al., Curr. 
Opinion Biotechnol. 8:724-33 (1997); Harayama, S. Trends Biotechnol. 16(2):76-82 
(1998); Hansson, L. O., et al., J. Mol. Biol. 287:265 76 (1999); and Lorenzo, M. M. and 
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Blasco, R. Biotechniques 24(2):308-13 (1998) (each of these patents and publications are 
hereby incorporated by reference). In one embodiment, alteration of polynucleotides and 
corresponding polypeptides of the invention may be achieved by DNA shuffling. DNA 
shuffling involves the assembly of two or more DNA segments into a desired 
5 polynucleotide sequence of the invention molecule by homologous, or site-specific, 
recombination. In another embodiment, polynucleotides and corresponding polypeptides 
of the invention may be alterred by being subjected to random mutagenesis by error-prone 
PCR, random nucleotide insertion or other methods prior to recombination. In another 
embodiment, one or more components, motifs, sections, parts, domains, fragments, etc., of 

10 the polypeptides of the invention may be recombined with one or more components, 
motifs, sections, parts, domains, fragments, etc. of one or more heterologous molecules. In 
preferred embodiments, the heterologous molecules are family members. In further 
preferred embodiments, the heterologous molecule is a growth factor such as, for example, 
platelet-derived growth factor (PDGF), insulin-like growth factor (1GF-I), transforming 

15 growth factor (TGF)-alpha, epidermal growth factor (EGF), fibroblast growth factor 
(FGF), TGF-beta, bone morphogenetic protein (BMP)-2, BMP-4, BMP-5, BMP-6, BMP-7, 
activins A and B, decapentaplegic(dpp), 60A, OP-2, dorsalin, growth differentiation 
factors (GDFs), nodal, MIS, inhibin-alpha, TGF-betal, TGF-beta2, TGF-beta3, TGF- 
beta5, and glial-derived neurotrophic factor (GDNF). 

20 Other preferred fragments are biologically active fragments of the polypeptides of 

the invention. Biologically active fragments are those exhibiting activity similar, but not 
necessarily identical, to an activity of the polypeptide. The biological activity of the 
fragments may include an improved desired activity, or a decreased undesirable activity. 

Additionally, this invention provides a method of screening compounds to identify 

25 those which modulate the action of the polypeptide of the present invention. An example 
of such an assay comprises combining a mammalian fibroblast cell, a the polypeptide of 
the present invention, the compound to be screened and 3[H] thymidine under cell culture 
conditions where the fibroblast cell would normally proliferate. A control assay may be 
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performed in the absence of the compound to be screened and compared to the amount of 
fibroblast proliferation in the presence of the compound to determine if the compound 
stimulates proliferation by determining the uptake of 3[H] thymidine in each case. The 
amount of fibroblast cell proliferation is measured by liquid scintillation chromatography 
which measures the incorporation of 3[H] thymidine. Both agonist and antagonist 
compounds may be identified by this procedure. 

In another method, a mammalian cell or membrane preparation expressing a 
receptor for a polypeptide of the present invention is incubated with a labeled polypeptide 
of the present invention in the presence of the compound. The ability of the compound to 
enhance or block this interaction could then be measured. Alternatively, the response of a 
known second messenger system following interaction of a compound to be screened and 
the receptor is measured and the ability of the compound to bind to the receptor and elicit a 
second messenger response is measured to determine if the compound is a potential agonist 
or antagonist. Such second messenger systems include but are not limited to, cAMP 
guanylate cyclase, ion channels or phosphoinositide hydrolysis. 

All of these above assays can be used as diagnostic or prognostic markers. The 
molecules discovered using these assays can be used to treat disease or to bring about a 
particular result in a patient (e.g., blood vessel growth) by activating or inhibiting the 
polypeptide/molecule. Moreover, the assays can discover agents which may inhibit or 
enhance the production of the polypeptides of the invention from suitably manipulated 
cells or tissues. Therefore, the invention includes a method of identifying compounds 
which bind to the polypeptides of the invention comprising the steps of: (a) incubating a 
candidate binding compound with the polypeptide; and (b) determining if binding has 
occurred. Moreover, the invention includes a method of identifying agonists/antagonists 
comprising the steps of: (a) incubating a candidate compound with the polypeptide, (b) 
assaying a biological activity , and (b) determining if a biological activity of the 
polypeptide has been altered. 

Also, one could identify molecules bind a polypeptide of the invention 
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experimentally by using the beta-pleated sheet regions contained in the polypeptide 
sequence of the protein. Accordingly, specific embodiments of the invention are directed 
to polynucleotides encoding polypeptides which comprise, or alternatively consist of, the 
amino acid sequence of each beta pleated sheet regions in a disclosed polypeptide 
sequence. Additional embodiments of the invention are directed to polynucleotides 
encoding polypeptides which comprise, or alternatively consist of, any combination or all 
of contained in the polypeptide sequences of the invention. Additional preferred 
embodiments of the invention are directed to polypeptides which comprise, or alternatively 
consist of, the amino acid sequence of each of the beta pleated sheet regions in one of the 
polypeptide sequences of the invention. Additional embodiments of the invention are 
directed to polypeptides which comprise, or alternatively consist of, any combination or all 
of the beta pleated sheet regions in one of the polypeptide sequences of the invention. 

Targeted Delivery 

In another embodiment, the invention provides a method of delivering 
compositions to targeted cells expressing a receptor for a polypeptide of the invention, or 
cells expressing a cell bound form of a polypeptide of the invention. 

As discussed herein, polypeptides or antibodies of the invention may be associated 
with heterologous polypeptides, heterologous nucleic acids, toxins, or prodrugs via 
hydrophobic, hydrophilic, ionic and/or covalent interactions. 

In one embodiment, the invention provides a method for the specific delivery of 
compositions of the invention to cells by administering polypeptides of the invention 
(including antibodies) that are associated with heterologous polypeptides or nucleic acids. 
In one example, the invention provides a method for delivering a therapeutic protein into 
the targeted cell. In another example, the invention provides a method for delivering a 
single stranded nucleic acid (e.g., antisense or ribozymes) or double stranded nucleic acid 
(e.g., DNA that can integrate into the cell's genome or replicate episomally and that can be 
transcribed) into the targeted cell. 
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In another embodiment, the invention provides a method for the specific 
destruction of cells (e.g., the destruction of tumor cells) by administering polypeptides of 
the invention (e.g., polypeptides of the invention or antibodies of the invention) in 
association with toxins or cytotoxic prodrugs. 

By "toxin" is meant compounds that bind and activate endogenous cytotoxic 
effector systems, radioisotopes, holotoxins, modified toxins, catalytic subunits of toxins, or 
any molecules or enzymes not normally present in or on the surface of a cell that under 
defined conditions cause the cell's death. Toxins that may be used according to the 
methods of the invention include, but are not limited to, radioisotopes known in the art, 
compounds such as, for example, antibodies (or complement fixing containing portions 
thereof) that bind an inherent or induced endogenous cytotoxic effector system, thymidine 
kinase, endonuclease, RNAse, alpha toxin, ricin, abrin, Pseudomonas exotoxin A, 
diphtheria toxin, saporin, momordin, gelonin, pokeweed antiviral protein, alpha-sarcin and 
cholera toxin. By "cytotoxic prodrug" is meant a non-toxic compound that is converted by 
an enzyme, normally present in the cell, into a cytotoxic compound. Cytotoxic prodrugs 
that may be used according to the methods of the invention include, but are not limited to, 
glutamyl derivatives of benzoic acid mustard alkylating agent, phosphate derivatives of 
etoposide or mitomycin C, cytosine arabinoside, daunorubisin, and phenoxyacetamide 
derivatives of doxorubicin. 


Dru g Screening 

Further contemplated is the use of the polypeptides of the present invention, or the 
polynucleotides encoding these polypeptides, to screen for molecules which modify the 
activities of the polypeptides of the present invention. Such a method would include 
contacting the polypeptide of the present invention with a selected compound(s) suspected 
of having antagonist or agonist activity, and assaying the activity of these polypeptides 
following binding. 

This invention is particularly useful for screening therapeutic compounds by using 
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the polypeptides of the present invention, or binding fragments thereof, in any of a variety 
of drug screening techniques. The polypeptide or fragment employed in such a test may be 
affixed to a solid support, expressed on a cell surface, free in solution, or located 
intracellularly. One method of drug screening utilizes eukaryotic or prokaryotic host cells 
which are stably transformed with recombinant nucleic acids expressing the polypeptide or 
fragment. Drugs are screened against such transformed cells in competitive binding assays. 
One may measure, for example, the formulation of complexes between the agent being 
tested and a polypeptide of the present invention. 

Thus, the present invention provides methods of screening for drugs or any other 
agents which affect activities mediated by the polypeptides of the present invention. These 
methods comprise contacting such an agent with a polypeptide of the present invention or 
a fragment thereof and assaying for the presence of a complex between the agent and the 
polypeptide or a fragment thereof, by methods well known in the art. In such a competitive 
binding assay, the agents to screen are typically labeled. Following incubation, free agent 
is separated from that present in bound form, and the amount of free or uncomplexed label 
is a measure of the ability of a particular agent to bind to the polypeptides of the present 
invention. 

Another technique for drug screening provides high throughput screening for 
compounds having suitable binding affinity to the polypeptides of the present invention, 
and is described in great detail in European Patent Application 84/03564, published on 
September 13, 1984, which is incorporated herein by reference herein. Briefly stated, large 
numbers of different small peptide test compounds are synthesized on a solid substrate, 
such as plastic pins or some other surface. The peptide test compounds are reacted with 
polypeptides of the present invention and washed. Bound polypeptides are then detected by 
methods well known in the art. Purified polypeptides are coated directly onto plates for use 
in the aforementioned drug screening techniques. In addition, non -neutralizing antibodies 
may be used to capture the peptide and immobilize it on the solid support. 
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This invention also contemplates the use of competitive drug screening assays in 
which neutralizing antibodies capable of binding polypeptides of the present invention 
specifically compete with a test compound for binding to the polypeptides or fragments 
thereof. In this manner, the antibodies are used to detect the presence of any peptide which 
shares one or more antigenic epitopes with a polypeptide of the invention. 

Antisense And Ribozvme f Antagonists^ 

In specific embodiments, antagonists according to the present invention are nucleic 
acids corresponding to the sequences contained in SEQ ID NO:X, or the complementary 
strand thereof, and/or to nucleotide sequences contained a deposited clone. In one 
embodiment, antisense sequence is generated internally by the organism, in another 
embodiment, the antisense sequence is separately administered (see, for example, 
O'Connor, Neurochern., 56:560 (1991). Oligodeoxy nucleotides as Anitsense Inhibitors of 
Gene Expression, CRC Press, Boca Raton, FL (1988). Antisense technology can be used 
to control gene expression through antisense DNA or RNA, or through triple-helix 
formation. Antisense techniques are discussed for example, in Okano, Neurochern., 
56:560 (1991); Oligodeoxynucleotides as Antisense Inhibitors of Gene Expression, CRC 
Press, Boca Raton, FL (1988). Triple helix formation is discussed in, for instance, Lee et 
al., Nucleic Acids Research, 6:3073 (1979); Cooney et al., Science, 241:456 (1988); and 
Dervan et al., Science, 251:1300 (1991). The methods are based on binding of a 
polynucleotide to a complementary DNA or RNA. 

For example, the use of c-myc and c-myb antisense RNA constructs to inhibit the 
growth of the non-lymphocytic leukemia cell line HL-60 and other cell lines was 
previously described. (Wickstrom et al. (1988); Anfossi et al. (1989)). These experiments 
were performed in vitro by incubating cells with the oligoribonucleotide. A similar 
procedure for in vivo use is described in WO 91/15580. Briefly, a pair of oligonucleotides 
for a given antisense RNA is produced as follows: A sequence complimentary to the first 
15 bases of the open reading frame is flanked by an EcoRl site on the 5 end and a Hindlll 
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site on the 3 end. Next, the pair of oligonucleotides is heated at 90 °C for one minute and 
then annealed in 2X ligation buffer (20mM TRIS HC1 pH 7.5, lOmM MgC12, 10MM 
dithiothreitol (DTT) and 0.2 mM ATP) and then ligated to the EcoRl/Hind III site of the 
retroviral vector PMV7 (WO 91/15580). 
5 For example, the 5' coding portion of a polynucleotide that encodes the mature 

polypeptide of the present invention may be used to design an antisense RNA 
oligonucleotide of from about 10 to 40 base pairs in length. A DNA oligonucleotide is 
designed to be complementary to a region of the gene involved in transcription thereby 
preventing transcription and the production of the receptor. The antisense RNA 

10 oligonucleotide hybridizes to the mRNA in vivo and blocks translation of the mRNA 
molecule into receptor polypeptide. 

In one embodiment, the antisense nucleic acid of the invention is produced 
intracellularly by transcription from an exogenous sequence. For example, a vector or a 
portion thereof, is transcribed, producing an antisense nucleic acid (RNA) of the invention. 

15 Such a vector would contain a sequence encoding the antisense nucleic acid of the 

invention. Such a vector can remain episomal or become chromosomally integrated, as 
long as it can be transcribed to produce the desired antisense RNA. Such vectors can be 
constructed by recombinant DNA technology methods standard in the art. Vectors can be 
plasmid, viral, or others known in the art, used for replication and expression in vertebrate 

20 cells. Expression of the sequence encoding a polypeptide of the invention, or fragments 
thereof, can be by any promoter known in the art to act in vertebrate, preferably human 
cells. Such promoters can be inducible or constitutive. Such promoters include, but are 
not limited to, the SV40 early promoter region (Bernoist and Chambon, Nature, 29:304- 
310 (1981), the promoter contained in the 3' long terminal repeat of Rous sarcoma virus 

25 (Yamamoto et al., Cell, 22:787-797 ( 1980), the herpes thymidine promoter (Wagner et al., 
Proc. Natl. Acad. Sci. U.S.A., 78:1441-1445 (1981), the regulatory sequences of the 
metallothionein gene (Brinster et al, Nature, 296:39-42 (1982)), etc. 
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The antisense nucleic acids of the invention comprise a sequence complementary to 
at least a portion of an RNA transcript of a gene of interest. However, absolute 
complementarity, although preferred, is not required. A sequence "complementary to at 
least a portion of an RNA," referred to herein, means a sequence having sufficient 
complementarity to be able to hybridize with the RNA f forming a stable duplex; in the case 
of double stranded antisense nucleic acids of the invention, a single strand of the duplex 
DNA may thus be tested, or triplex formation may be assayed. The ability to hybridize 
will depend on both the degree of complementarity and the length of the antisense nucleic 
acid Generally, the larger the hybridizing nucleic acid, the more base mismatches with a 
RNA sequence of the invention it may contain and still form a stable duplex (or triplex as 
the case may be). One skilled in the art can ascertain a tolerable degree of mismatch by 
use of standard procedures to determine the melting point of the hybridized complex. 

Oligonucleotides that are complementary to the 5' end of the message, e.g., the 5" 
untranslated sequence up to and including the AUG initiation codon, should work most 
efficiently at inhibiting translation. However, sequences complementary to the 3' 
untranslated sequences of mRNAs have been shown to be effective at inhibiting translation 
of mRNAs as well. See generally, Wagner, R., Nature, 372:333-335 (1994). Thus, 
oligonucleotides complementary to either the 5' - or 3' - non- translated, non-coding 
regions of a polynucleotide sequence of the invention could be used in an antisense 
approach to inhibit translation of endogenous mRNA. Oligonucleotides complementary to 
the 5' untranslated region of the mRNA should include the complement of the AUG start 
codon. Antisense oligonucleotides complementary to mRNA coding regions are less 
efficient inhibitors of translation but could be used in accordance with the invention. 
Whether designed to hybridize to the 5' -, 3' - or coding region of mRNA, antisense 
nucleic acids should be at least six nucleotides in length, and are preferably 
oligonucleotides ranging from 6 to about 50 nucleotides in length. In specific aspects the 
oligonucleotide is at least 10 nucleotides, at least 17 nucleotides, at least 25 nucleotides or 
at least 50 nucleotides. 
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The polynucleotides of the invention can be DNA or RNA or chimeric mixtures or 
derivatives or modified versions thereof, single-stranded or double- stranded. The 
oligonucleotide can be modified at the base moiety, sugar moiety, or phosphate backbone, 
for example, to improve stability of the molecule, hybridization, etc. The oligonucleotide 
may include other appended groups such as peptides (e.g., for targeting host cell receptors 
in vivo), or agents facilitating transport across the cell membrane (see, e.g., Letsinger et al., 
Proc. Natl. Acad. Sci. U.S.A. 86:6553-6556 (1989); Lemaitre et al., Proc. Natl. Acad. Sci., 
84:648-652 (1987); PCT Publication NO: WO88/09810, published December 15, 1988) or 
the blood-brain barrier (see, e.g., PCT Publication NO: WO89/10134, published April 25, 
1988), hybridization-triggered cleavage agents. (See, e.g., Krol et al., BioTechniques, 
6:958-976 (1988)) or intercalating agents. (See, e.g., Zon, Pharm. Res., 5:539-549 
(1988)). To this end, the oligonucleotide may be conjugated to another molecule, e.g., a 
peptide, hybridization triggered cross-linking agent, transport agent, hybridization- 
triggered cleavage agent, etc. 

The antisense oligonucleotide may comprise at least one modified base moiety 
which is selected from the group including, but not limited to, 5-fluorouracil, 
5-bromouracil, 5-chlorouracil, 5-iodouracil, hypoxanthine, xantine, 4-acetylcytosine, 
5-(carboxyhydroxylmethyl) uracil, 5-carboxymethylaminomethyl-2-thiouridine, 
5-carboxymethylaminomethyl uracil, dihydrouracil, beta-D-galactosylqueosine, inosine, 
N6-isopentenyladenine, 1-methylguanine, 1-methylinosine, 2,2-dimethylguanine, 
2-methyladenine, 2-methylguanine, 3-methyIcytosine, 5-methylcytosine, N6-adenine, 
7-methylguanine, 5-methylaminomethyluracil, 5-methoxyaminomethyl-2-thiouracil, beta- 
D-mannosylqueosine, 5'-methoxycarboxymethyluracil, 5-methoxyuracil, 
2-methylthio-N6-isopentenyladenine, uraci 1 -5 -oxy acetic acid (v), wybutoxosine, 
pseudouracil, queosine, 2-thiocytosine, 5-methyl-2-thiouracil, 2-thiouracil, 4-thiouracil, 
5-methy] uracil, uraci 1-5-oxy acetic acid methylester, uraci 1-5-oxy acetic acid (v), 5-methyl- 
2-thiouracil, 3-(3-amino-3-N-2-carboxypropyI) uracil, (acp3)w, and 2,6-diaminopurine. 
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The antisense oligonucleotide may also comprise at least one modified sugar 
moiety selected from the group including, but not limited to, arabinose, 2-fluoroarabinose, 
xylulose, and hexose. 

In yet another embodiment, the antisense oligonucleotide comprises at least one 
modified phosphate backbone selected from the group including, but not limited to, a 
phosphorothioate, a phosphorodithioate, a phosphoramidothioate, a phosphoramidate, a 
phosphordiamidate, a methylphosphonate, an alkyl phosphotri ester, and a formacetal or 
analog thereof. 

In yet another embodiment, the antisense oligonucleotide is an a-anomeric 
oligonucleotide. An a-anomeric oligonucleotide forms specific double-stranded hybrids 
with complementary RNA in which, contrary to the usual b-units, the strands run parallel 
to each other (Gautier et al., Nuci. Acids Res., 15:6625-6641 (1987)). The oligonucleotide 
is a 2-0-methylribonucleotide (Inoue et al., Nucl. Acids Res., 15:6131-6148 (1987)), or a 
chimeric RNA-DNA analogue (Inoue et al., FEBS Lett. 215:327-330 (1987)). 

Polynucleotides of the invention may be synthesized by standard methods known in 
the art, e.g. by use of an automated DNA synthesizer (such as are commercially available 
from Biosearch, Applied Biosystems, etc.). As examples, phosphorothioate 
oligonucleotides may be synthesized by the method of Stein et al. (Nucl. Acids Res., 
16:3209 (1988)), methylphosphonate oligonucleotides can be prepared by use of controlled 
pore glass polymer supports (Sarin et al., Proc. Natl. Acad. Sci. U.S.A., 85:7448-7451 
(1988)), etc. 

While antisense nucleotides complementary to the coding region sequence of the 
invention could be used, those complementary to the transcribed untranslated region are 
most preferred. 

Potential antagonists according to the invention also include catalytic RNA, or a 
ribozyme (See, e.g., PCT International Publication WO 90/1 1364, published October 4, 
1990; Sarver et al, Science, 247:1222-1225 (1990). While ribozymes that cleave mRNA at 
site specific recognition sequences can be used to destroy mRNAs corresponding to the 
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polynucleotides of the invention, the use of hammerhead ribozymes is preferred. 
Hammerhead ribozymes cleave mRNAs at locations dictated by flanking regions that form 
complementary base pairs with the target mRNA. The sole requirement is that the target 
mRNA have the following sequence of two bases: 5' -UG-3' . The construction and 
production of hammerhead ribozymes is well known in the art and is described more fully 
in Haseloff and Gerlach, Nature, 334:585-591 (1988). There are numerous potential 
hammerhead ribozyme cleavage sites within each nucleotide sequence disclosed in the 
sequence listing. Preferably, the ribozyme is engineered so that the cleavage recognition 
site is located near the 5' end of the mRNA corresponding to the polynucleotides of the 
invention; i.e., to increase efficiency and minimize the intracellular accumulation of non- 
functional mRNA transcripts. 

As in the antisense approach, the ribozymes of the invention can be composed of 
modified oligonucleotides (e.g. for improved stability, targeting, etc.) and should be 
delivered to cells which express the polynucleotides of the invention in vivo. DNA 
constructs encoding the ribozyme may be introduced into the cell in the same manner as 
described above for the introduction of antisense encoding DNA. A preferred method of 
delivery involves using a DNA construct "encoding" the ribozyme under the control of a 
strong constitutive promoter, such as, for example, pol III or pol II promoter, so that 
transfected cells will produce sufficient quantities of the ribozyme to destroy endogenous 
messages and inhibit translation. Since ribozymes unlike antisense molecules, are 
catalytic, a lower intracellular concentration is required for efficiency. 

Antagonist/agonist compounds may be employed to inhibit the cell growth and 
proliferation effects of the polypeptides of the present invention on neoplastic ceils and 
tissues, i.e. stimulation of angiogenesis of tumors, and, therefore, retard or prevent 
abnormal cellular growth and proliferation, for example, in tumor formation or growth. 

The antagonist/agonist may also be employed to prevent hyper-vascular diseases, 
and prevent the proliferation of epithelial lens cells after extracapsular cataract surgery. 
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Prevention of the mitogenic activity of the polypeptides of the present invention may also 
be desirous in cases such as restenosis after balloon angioplasty. 

The antagonist/agonist may also be employed to prevent the growth of scar tissue 
during wound healing. 

The antagonist/agonist may also be employed to treat the diseases described herein. 
Thus, the invention provides a method of treating disorders or diseases, including but not 
limited to the disorders or diseases listed throughout this application, associated with 
overexpression of a polynucleotide of the present invention by administering to a patient 

(a) an antisense molecule directed to the polynucleotide of the present invention, and/or 

(b) a ribozyme directed to the polynucleotide of the present invention 
Other Activities 

The polypeptide of the present invention, as a result of the ability to stimulate 
vascular endothelial cell growth, may be employed in treatment for stimulating re- 
vascularization of ischemic tissues due to various disease conditions such as thrombosis, 
arteriosclerosis, and other cardiovascular conditions. These polypeptide may also be 
employed to stimulate angiogenesis and limb regeneration, as discussed above. 

The polypeptide may also be employed for treating wounds due to injuries, burns, 
post-operative tissue repair, and ulcers since they are mitogenic to various cells of different 
origins, such as fibroblast cells and skeletal muscle cells, and therefore, facilitate the repair 
or replacement of damaged or diseased tissue. 

The polypeptide of the present invention may also be employed stimulate neuronal 
growth and to treat and prevent neuronal damage which occurs in certain neuronal 
disorders or neuro-degenerative conditions such as Alzheimer's disease, Parkinson's 
disease, and AIDS-related complex. The polypeptide of the invention may have the ability 
to stimulate chondrocyte growth, therefore, they may be employed to enhance bone and 
periodontal regeneration and aid in tissue transplants or bone grafts. 

The polypeptide of the present invention may be also be employed to prevent skin 
aging due to sunburn by stimulating keratinocyte growth. 
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The polypeptide of the invention may also be employed for preventing hair loss, 
since FGF family members activate hair-forming cells and promotes melanocyte growth. 
Along the same lines, the polypeptides of the present invention may be employed to 
stimulate growth and differentiation of hematopoietic cells and bone marrow cells when 
used in combination with other cytokines. 

The polypeptide of the invention may also be employed to maintain organs before 
transplantation or for supporting cell culture of primary tissues. 

The polypeptide of the present invention may also be employed for inducing tissue 
of mesodermal origin to differentiate in early embryos. 

The polypeptide or polynucleotides and/or agonist or antagonists of the present 
invention may also increase or decrease the differentiation or proliferation of embryonic 
stem cells, besides, as discussed above, hematopoietic lineage. 

The polypeptide or polynucleotides and/or agonist or antagonists of the present 
invention may also be used to modulate mammalian characteristics, such as body height, 
weight, hair color, eye color, skin, percentage of adipose tissue, pigmentation, size, and 
shape (e.g., cosmetic surgery). Similarly, polypeptides or polynucleotides and/or agonist 
or antagonists of the present invention may be used to modulate mammalian metabolism 
affecting catabolism, anabolism, processing, utilization, and storage of energy. 

Polypeptide or polynucleotides and/or agonist or antagonists of the present 
invention may be used to change a mammal's mental state or physical state by influencing 
biorhythms, caricadic rhythms, depression (including depressive disorders), tendency for 
violence, tolerance for pain, reproductive capabilities (preferably by Activin or Inhibin-like 
activity), hormonal or endocrine levels, appetite, libido, memory, stress, or other cognitive 
qualities. 

Polypeptide or polynucleotides and/or agonist or antagonists of the present 
invention may also be used as a food additive or preservative, such as to increase or 
decrease storage capabilities, fat content, lipid, protein, carbohydrate, vitamins, minerals, 
cofactors or other nutritional components. 
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Other Preferred Embodiments 

Other preferred embodiments of the claimed invention include an isolated nucleic 
acid molecule comprising a nucleotide sequence which is at least 95% identical to a 
sequence of at least about 50 contiguous nucleotides in the nucleotide sequence of SEQ ID 
NO:X wherein X is any integer as defined in Table XIII. 

Also preferred is a nucleic acid molecule wherein said sequence of contiguous 
nucleotides is included in the nucleotide sequence of SEQ ID NO:X in the range of 
positions beginning with the nucleotide at about the position of the 5" Nucleotide of the 
Clone Sequence and ending with the nucleotide at about the position of the 3' Nucleotide 
of the Clone Sequence as defined for SEQ ID NO:X in Table XIII. 

Also preferred is a nucleic acid molecule wherein said sequence of contiguous 
nucleotides is included in the nucleotide sequence of SEQ ID NO:X in the range of 
positions beginning with the nucleotide at about the position of the 5" Nucleotide of the 
Start Codon and ending with the nucleotide at about the position of the 3' Nucleotide of 
the Clone Sequence as defined for SEQ ID NO:X in Table XIII. 

Similarly preferred is a nucleic acid molecule wherein said sequence of contiguous 
nucleotides is included in the nucleotide sequence of SEQ ID NO:X in the range of 
positions beginning with the nucleotide at about the position of the 5' Nucleotide of the 
First Amino Acid of the Signal Peptide and ending with the nucleotide at about the 
position of the 3' Nucleotide of the Clone Sequence as defined for SEQ ID NO:X in Table 
XIII. 

Also preferred is an isolated nucleic acid molecule comprising a nucleotide 
sequence which is at least 95% identical to a sequence of at least about 150 contiguous 
nucleotides in the nucleotide sequence of SEQ ID NO:X. 

Further preferred is an isolated nucleic acid molecule comprising a nucleotide 
sequence which is at least 95% identical to a sequence of at least about 500 contiguous 
nucleotides in the nucleotide sequence of SEQ ID NO:X. 
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A further preferred embodiment is a nucleic acid molecule comprising a nucleotide 
sequence which is at least 95% identical to the nucleotide sequence of SEQ ID NO:X 
beginning with the nucleotide at about the position of the 5' Nucleotide of the First Amino 
Acid of the Signal Peptide and ending with the nucleotide at about the position of the 3' 
Nucleotide of the Clone Sequence as defined for SEQ ID NO:X in Table XIII. 

A further preferred embodiment is an isolated nucleic acid molecule comprising a 
nucleotide sequence which is at least 95% identical to the complete nucleotide sequence of 
SEQ ID NO:X. 

Also preferred is an isolated nucleic acid molecule which hybridizes under 
stringent hybridization conditions to a nucleic acid molecule, wherein said nucleic acid 
molecule which hybridizes does not hybridize under stringent hybridization conditions to a 
nucleic acid molecule having a nucleotide sequence consisting of only A residues or of 
only T residues. 

Also preferred is a composition of matter comprising a DNA molecule which 
comprises a human cDNA clone identified by a cDNA Clone Identifier in Table XIII, 
which DNA molecule is contained in the material deposited with the American Type 
Culture Collection and given the ATCC Deposit Number shown in Table XIII for said 
cDNA Clone Identifier. 

Also preferred is an isolated nucleic acid molecule comprising a nucleotide 
sequence which is at least 95% identical to a sequence of at least 50 contiguous nucleotides 
in the nucleotide sequence of a human cDNA clone identified by a cDNA Clone Identifier 
in Table XIII, which DNA molecule is contained in the deposit given the ATCC Deposit 
Number shown in Table XIII. 

Also preferred is an isolated nucleic acid molecule, wherein said sequence of at 
least 50 contiguous nucleotides is included in the nucleotide sequence of the complete 
open reading frame sequence encoded by said human cDNA clone. 
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Also preferred is an isolated nucleic acid molecule comprising a nucleotide 
sequence which is at least 95% identical to sequence of at least 150 contiguous nucleotides 
in the nucleotide sequence encoded by said human cDNA clone. 

A further preferred embodiment is an isolated nucleic acid molecule comprising a 
nucleotide sequence which is at least 95% identical to sequence of at least 500 contiguous 
nucleotides in the nucleotide sequence encoded by said human cDNA clone. 

A further preferred embodiment is an isolated nucleic acid molecule comprising a 
nucleotide sequence which is at least 95% identical to the complete nucleotide sequence 
encoded by said human cDNA clone. 

A further preferred embodiment is a method for detecting in a biological sample a 
nucleic acid molecule comprising a nucleotide sequence which is at least 95% identical to 
a sequence of at least 50 contiguous nucleotides in a sequence selected from the group 
consisting of: a nucleotide sequence of SEQ ID NO:X wherein X is any integer as defined 
in Table XIII; and a nucleotide sequence encoded by a human cDNA clone identified by a 
cDNA Clone Identifier in Table XIII and contained in the deposit with the ATCC Deposit 
Number shown for said cDNA clone in Table XIII; which method comprises a step of 
comparing a nucleotide sequence of at least one nucleic acid molecule in said sample with 
a sequence selected from said group and determining whether the sequence of said nucleic 
acid molecule in said sample is at least 95% identical to said selected sequence. 

Also preferred is the above method wherein said step of comparing sequences 
comprises determining the extent of nucleic acid hybridization between nucleic acid 
molecules in said sample and a nucleic acid molecule comprising said sequence selected 
from said group. Similarly, also preferred is the above method wherein said step of 
comparing sequences is performed by comparing the nucleotide sequence determined from 
a nucleic acid molecule in said sample with said sequence selected from said group. The 
nucleic acid molecules can comprise DNA molecules or RNA molecules. 

A further preferred embodiment is a method for identifying the species, tissue or 
cell type of a biological sample which method comprises a step of detecting nucleic acid 
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molecules in said sample, if any, comprising a nucleotide sequence that is at least 95% 
identical to a sequence of at least 50 contiguous nucleotides in a sequence selected from 
the group consisting of: a nucleotide sequence of SEQ ID NO:X wherein X is any integer 
as defined in Table XIII; and a nucleotide sequence encoded by a human cDNA clone 
identified by a cDNA Clone Identifier in Table XIII and contained in the deposit with the 
ATCC Deposit Number shown for said cDNA clone in Table XIII. 

The method for identifying the species, tissue or cell type of a biological sample 
can comprise a step of detecting nucleic acid molecules comprising a nucleotide sequence 
in a panel of at least two nucleotide sequences, wherein at least one sequence in said panel 
is at least 95% identical to a sequence of at least 50 contiguous nucleotides in a sequence 
selected from said group. 

Also preferred is a method for diagnosing in a subject a pathological condition 
associated with abnormal structure or expression of a gene encoding a secreted protein 
identified in Table XIII, which method comprises a step of detecting in a biological sample 
obtained from said subject nucleic acid molecules, if any, comprising a nucleotide 
sequence that is at least 95% identical to a sequence of at least 50 contiguous nucleotides 
in a sequence selected from the group consisting of: a nucleotide sequence of SEQ ID 
NO:X wherein X is any integer as defined in Table XIII; and a nucleotide sequence 
encoded by a human cDNA clone identified by a cDNA Clone Identifier in Table XIII and 
contained in the deposit with the ATCC Deposit Number shown for said cDNA clone in 
Table XIII. 

The method for diagnosing a pathological condition can comprise a step of 
detecting nucleic acid molecules comprising a nucleotide sequence in a panel of at least 
two nucleotide sequences, wherein at least one sequence in said panel is at least 95% 
identical to a sequence of at least 50 contiguous nucleotides in a sequence selected from 
said group. 

Also preferred is a composition of matter comprising isolated nucleic acid 
molecules wherein the nucleotide sequences of said nucleic acid molecules comprise a 
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panel of at least two nucleotide sequences, wherein at least one sequence in said panel is at 
least 95% identical to a sequence of at least 50 contiguous nucleotides in a sequence 
selected from the group consisting of: a nucleotide sequence of SEQ ID NO:X wherein X 
is any integer as defined in Table XIII; and a nucleotide sequence encoded by a human 
cDNA clone identified by a cDNA Clone Identifier in Table XIII and contained in the 
deposit with the ATCC Deposit Number shown for said cDNA clone in Table XIII. The 
nucleic acid molecules can comprise DNA molecules or RNA molecules. 

Also preferred is an isolated polypeptide comprising an amino acid sequence at 
least 90% identical to a sequence of at least about 10 contiguous amino acids in the amino 
acid sequence of SEQ ID NO: Y wherein Y is any integer as defined in Table XIII. 

Also preferred is a polypeptide, wherein said sequence of contiguous amino acids is 
included in the amino acid sequence of SEQ ID NO: Y in the range of positions beginning 
with the residue at about the position of the First Amino Acid of the Secreted Portion and 
ending with the residue at about the Last Amino Acid of the Open Reading Frame as set 
forth for SEQ ID NO: Y in Table XIII. 

Also preferred is an isolated polypeptide comprising an amino acid sequence at 
least 95% identical to a sequence of at least about 30 contiguous amino acids in the amino 
acid sequence of SEQ ID NO:Y. 

Further preferred is an isolated polypeptide comprising an amino acid sequence at 
least 95% identical to a sequence of at least about 100 contiguous amino acids in the amino 
acid sequence of SEQ ID NO:Y. 

Further preferred is an isolated polypeptide comprising an amino acid sequence at 
least 95% identical to the complete amino acid sequence of SEQ ID NO: Y. 

Further preferred is an isolated polypeptide comprising an amino acid sequence at 
least 90% identical to a sequence of at least about 10 contiguous amino acids in the 
complete amino acid sequence of a secreted protein encoded by a human cDNA clone 
identified by a cDNA Clone Identifier in Table XIII and contained in the deposit with the 
ATCC Deposit Number shown for said cDNA clone in Table XIII. 
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Also preferred is a polypeptide wherein said sequence of contiguous amino acids is 
included in the amino acid sequence of a secreted portion of the secreted protein encoded 
by a human cDNA clone identified by a cDNA Clone Identifier in Table XIII and 
contained in the deposit with the ATCC Deposit Number shown for said cDNA clone in 
Table XIIL 

Also preferred is an isolated polypeptide comprising an amino acid sequence at 
least 95% identical to a sequence of at least about 30 contiguous amino acids in the amino 
acid sequence of the secreted portion of the protein encoded by a human cDNA clone 
identified by a cDNA Clone Identifier in Table XIII and contained in the deposit with the 
ATCC Deposit Number shown for said cDNA clone in Table XIII . 

Also preferred is an isolated polypeptide comprising an amino acid sequence at 
least 95% identical to a sequence of at least about 100 contiguous amino acids in the amino 
acid sequence of the secreted portion of the protein encoded by a human cDNA clone 
identified by a cDNA Clone Identifier in Table XIII and contained in the deposit with the 
ATCC Deposit Number shown for said cDNA clone in Table XIII. 

Also preferred is an isolated polypeptide comprising an amino acid sequence at 
least 95% identical to the amino acid sequence of the secreted portion of the protein 
encoded by a human cDNA clone identified by a cDNA Clone Identifier in Table XIII and 
contained in the deposit with the ATCC Deposit Number shown for said cDNA clone in 
Table XIIL 

Further preferred is an isolated antibody which binds specifically to a polypeptide 
comprising an amino acid sequence that is at least 90% identical to a sequence of at least 
10 contiguous amino acids in a sequence selected from the group consisting of: an amino 
acid sequence of SEQ ID NO: Y wherein Y is any integer as defined in Table XIII; and a 
complete amino acid sequence of a protein encoded by a human cDNA clone identified by 
a cDNA Clone Identifier in Table XIII and contained in the deposit with the ATCC 
Deposit Number shown for said cDNA clone in Table XIIL 
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Further preferred is a method for detecting in a biological sample a polypeptide 
comprising an amino acid sequence which is at least 90% identical to a sequence of at least 
10 contiguous amino acids in a sequence selected from the group consisting of: an amino 
acid sequence of SEQ ID NO: Y wherein Y is any integer as defined in Table XIII; and a 
complete amino acid sequence of a protein encoded by a human cDNA clone identified by 
a cDNA Clone Identifier in Table XIII and contained in the deposit with the ATCC 
Deposit Number shown for said cDNA clone in Table XIII; which method comprises a 
step of comparing an amino acid sequence of at least one polypeptide molecule in said 
sample with a sequence selected from said group and determining whether the sequence of 
said polypeptide molecule in said sample is at least 90% identical to said sequence of at 
least 10 contiguous amino acids. 

Also preferred is the above method wherein said step of comparing an amino acid 
sequence of at least one polypeptide molecule in said sample with a sequence selected 
from said group comprises determining the extent of specific binding of polypeptides in 
said sample to an antibody which binds specifically to a polypeptide comprising an amino 
acid sequence that is at least 90% identical to a sequence of at least 10 contiguous amino 
acids in a sequence selected from the group consisting of: an amino acid sequence of SEQ 
ID NO: Y wherein Y is any integer as defined in Table XIII; and a complete amino acid 
sequence of a protein encoded by a human cDNA clone identified by a cDNA Clone 
Identifier in Table XIII and contained in the deposit with the ATCC Deposit Number 
shown for said cDNA clone in Table XIII. 

Also preferred is the above method wherein said step of comparing sequences is 
performed by comparing the amino acid sequence determined from a polypeptide molecule 
in said sample with said sequence selected from said group. 

Also preferred is a method for identifying the species, tissue or cell type of a 
biological sample which method comprises a step of detecting polypeptide molecules in 
said sample, if any, comprising an amino acid sequence that is at least 90% identical to a 
sequence of at least 10 contiguous amino acids in a sequence selected from the group 
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consisting of: an amino acid sequence of SEQ ID NO:Y wherein Y is any integer as 
defined in Table XIII; and a complete amino acid sequence of a secreted protein encoded 
by a human cDNA clone identified by a cDNA Clone Identifier in Table XIII and 
contained in the deposit with the ATCC Deposit Number shown for said cDNA clone in 
Table XIII. 

Also preferred is the above method for identifying the species, tissue or cell type of 
a biological sample, which method comprises a step of detecting polypeptide molecules 
comprising an amino acid sequence in a panel of at least two amino acid sequences, 
wherein at least one sequence in said panel is at least 90% identical to a sequence of at 
least 10 contiguous amino acids in a sequence selected from the above group. 

Also preferred is a method for diagnosing in a subject a pathological condition 
associated with abnormal structure or expression of a gene encoding a secreted protein 
identified in Table XIII, which method comprises a step of detecting in a biological sample 
obtained from said subject polypeptide molecules comprising an amino acid sequence in a 
panel of at least two amino acid sequences, wherein at least one sequence in said panel is 
at least 90% identical to a sequence of at least 10 contiguous amino acids in a sequence 
selected from the group consisting of: an amino acid sequence of SEQ ID NO:Y wherein Y 
is any integer as defined in Table XIII; and a complete amino acid sequence of a secreted 
protein encoded by a human cDNA clone identified by a cDNA Clone Identifier in Table 
XIII and contained in the deposit with the ATCC Deposit Number shown for said cDNA 
clone in Table XIII. 

In any of these methods, the step of detecting said polypeptide molecules includes 
using an antibody. 

Also preferred is an isolated nucleic acid molecule comprising a nucleotide 
sequence which is at least 95% identical to a nucleotide sequence encoding a polypeptide 
wherein said polypeptide comprises an amino acid sequence that is at least 90% identical 
to a sequence of at least 10 contiguous amino acids in a sequence selected from the group 
consisting of: an amino acid sequence of SEQ ID NO:Y wherein Y is any integer as 
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defined in Table XIII; and a complete amino acid sequence of a secreted protein encoded 
by a human cDNA clone identified by a cDNA Clone Identifier in Table XIII and 
contained in the deposit with the ATCC Deposit Number shown for said cDNA clone in 
Table XIII. 

Also preferred is an isolated nucleic acid molecule, wherein said nucleotide 
sequence encoding a polypeptide has been optimized for expression of said polypeptide in 
a prokaryotic host. 

Also preferred is an isolated nucleic acid molecule, wherein said polypeptide 
comprises an amino acid sequence selected from the group consisting of: an amino acid 
sequence of SEQ ID NO: Y wherein Y is any integer as defined in Table XIII; and a 
complete amino acid sequence of a secreted protein encoded by a human cDNA clone 
identified by a cDNA Clone Identifier in Table XIII and contained in the deposit with the 
ATCC Deposit Number shown for said cDNA clone in Table XIII. 

Further preferred is a method of making a recombinant vector comprising inserting 
any of the above isolated nucleic acid molecule into a vector. Also preferred is the 
recombinant vector produced by this method. Also preferred is a method of making a 
recombinant host cell comprising introducing the vector into a host cell, as well as the 
recombinant host cell produced by this method. 

Also preferred is a method of making an isolated polypeptide comprising culturing 
this recombinant host cell under conditions such that said polypeptide is expressed and 
recovering said polypeptide. Also preferred is this method of making an isolated 
polypeptide, wherein said recombinant host cell is a eukaryotic ceil and said polypeptide is 
a secreted portion of a human secreted protein comprising an amino acid sequence selected 
from the group consisting of: an amino acid sequence of SEQ ID NO: Y beginning with the 
residue at the position of the First Amino Acid of the Secreted Portion of SEQ ID NO: Y 
wherein Y is an integer set forth in Table XIII and said position of the First Amino Acid of 
the Secreted Portion of SEQ ID NO:Y is defined in Table XIII; and an amino acid 
sequence of a secreted portion of a protein encoded by a human cDNA clone identified by 
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a cDNA Clone Identifier in Table XIII and contained in the deposit with the ATCC 
Deposit Number shown for said cDNA clone in Table XIII. The isolated polypeptide 
produced by this method is also preferred. 

Also preferred is a method of treatment of an individual in need of an increased 
level of a secreted protein activity, which method comprises administering to such an 
individual a pharmaceutical composition comprising an amount of an isolated polypeptide, 
polynucleotide, or antibody of the claimed invention effective to increase the level of said 
protein activity in said individual. 

The above-recited applications have uses in a wide variety of hosts. Such hosts 
include, but are not limited to, human, murine, rabbit, goat, guinea pig, camel, horse, 
mouse, rat, hamster, pig, micro-pig, chicken, goat, cow, sheep, dog, cat, non-human 
primate, and human. In specific embodiments, the host is a mouse, rabbit, goat, guinea 
pig, chicken, rat, hamster, pig, sheep, dog or cat. In preferred embodiments, the host is a 
mammal. In most preferred embodiments, the host is a human. 

Having generally described the invention, the same will be more readily understood 
by reference to the following examples, which are provided by way of illustration and are 
not intended as limiting. 

Examples 

Example 1: Isolatio n of a Selected cDNA Clone From the Deposited Sample 

Each cDNA clone in a cited ATCC deposit is contained in a plasmid vector. Table 
XIII identifies the vectors used to construct the cDNA library from which each clone was 
isolated. In many cases, the vector used to construct the library is a phage vector from 
which a plasmid has been excised. The table immediately below correlates the related 
plasmid for each phage vector used in constructing the cDNA library. For example, where 
a particular clone is identified in Table XIII as being isolated in the vector "Lambda Zap," 
the corresponding deposited clone is in "pBluescript." 
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Vector Used to Construct Library 

Lambda Zap 

Uni-Zap XR 

Zap Express 

lafmid BA 

pSportl 

pCMVSport 2.0 
pCMVSport 3.0 
pCR @ 2.1 


Corresponding Deposited Plasmid 
pBluescript (pBS) 
pBluescript (pBS) 
pBK 

plafmid B A 
pSportl 

pCMVSport 2.0 
pCMVSport 3.0 
pCR®2.1 


Vectors Lambda Zap (U.S. Patent Nos. 5.128,256 and 5,286,636), Uni-Zap XR 
(U.S. Patent Nos. 5,128, 256 and 5,286,636), Zap Express (U.S. Patent Nos. 5,128,256 and 
5,286,636), pBluescript (pBS) (Short, J. M. et al., Nucleic Acids Res. 16:7583-7600 
(1988); AJting-Mees, M. A. and Short, J. M., Nucleic Acids Res. 17:9494 (1989)) and pBK 
(Alting-Mees, M. A. et al., Strategies 5:58-61 (1992)) are commercially available from 
Stratagene Cloning Systems, Inc., 1101 1 N. Torrey Pines Road, La Jolla, CA, 92037. pBS 
contains an ampicillin resistance gene and pBK contains a neomycin resistance gene. Both 
can be transformed into E. coli strain XL-1 Blue, also available from Stratagene. pBS 
comes in 4 forms SK+, SK-, KS+ and KS. The S and K refers to the orientation of the 
polylinker to the T7 and T3 primer sequences which Hank the polylinker region ("S" is for 
Sad and "K" is for Kpnl which are the first sites on each respective end of the linker). 
or refer to the orientation of the fl origin of replication ("ori"), such that in one 
orientation, single stranded rescue initiated from the f 1 ori generates sense strand DNA and 
in the other, antisense. 

Vectors pSportl, pCMVSport 2.0 and pCMVSport 3.0, were obtained from Life 
Technologies, Inc., P. O. Box 6009, Gaithersburg, MD 20897. All Sport vectors contain 
an ampicillin resistance gene and may be transformed into E. coli strain DH10B, also 
available from Life Technologies, (See, for instance, Gruber, C. E., et ah, Focus 15:59 
(1993).) Vector lafmid BA (Bento Soares, Columbia University, NY) contains an 
ampicillin resistance gene and can be transformed into E. coli strain XL-1 Blue. Vector 
pCR®2.1, which is available from Invitrogen, 1600 Faraday Avenue, Carlsbad, CA 92008, 
contains an ampicillin resistance gene and may be transformed into E. coli strain DH10B, 
available from Life Technologies. (See, for instance, Clark, J. M., Nuc. Acids Res. 


SUBSTITUTE SHEET (RULE 26) 


WO 00/29435 PCT/US99/2503I 

463 

16:9677-9686 (1988) and Mead, D. et al., Bio/Technology 9: (1991)0 Preferably, a 
polynucleotide of the present invention does not comprise the phage vector sequences 
identified for the particular clone in Table XIII, as well as the corresponding plasmid 
vector sequences designated above. 

The deposited material in the sample assigned the ATCC Deposit Number cited in 
Table XIII for any given cDNA clone also may contain one or more additional plasmids, 
each comprising a cDNA clone different from that given clone. Thus, deposits sharing the 
same ATCC Deposit Number contain at least a plasmid for each cDNA clone identified in 
Table XIII. Typically, each ATCC deposit sample cited in Table XIII comprises a mixture 
of approximately equal amounts (by weight) of about 50 plasmid DNAs, each containing a 
different cDNA clone; but such a deposit sample may include piasmids for more or less 
than 50 cDNA clones, up to about 500 cDNA clones. 

Two approaches can be used to isolate a particular clone from the deposited sample 
of plasmid DNAs cited for that clone in Table XIII. First, a plasmid is directly isolated by 
screening the clones using a polynucleotide probe corresponding to SEQ ID NO:X. 

Particularly, a specific polynucleotide with 30-40 nucleotides is synthesized using 
an Applied Biosystems DNA synthesizer according to the sequence reported. The 
oligonucleotide is labeled, for instance, with 32 P-y-ATP using T4 polynucleotide kinase and 
purified according to routine methods. (E.g., Maniatis et al., Molecular Cloning: A 
Laboratory Manual, Cold Spring Harbor Press, Cold Spring, NY (1982).) The plasmid 
mixture is transformed into a suitable host, as indicated above (such as XL-1 Blue 
(Stratagene)) using techniques known to those of skill in the art, such as those provided by 
the vector supplier or in related publications or patents cited above. The transformants are 
plated on 1.5% agar plates (containing the appropriate selection agent, e.g., ampicillin) to a 
density of about 150 transformants (colonies) per plate. These plates are screened using 
Nylon membranes according to routine methods for bacterial colony screening (e.g., 
Sambrook et al., Molecular Cloning: A Laboratory Manual, 2nd Edit., (1989), Cold Spring 
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Harbor Laboratory Press, pages 1.93 to 1.104), or other techniques known to those of skill 
in the art. 

Alternatively, two primers of 17-20 nucleotides derived from both ends of the SEQ 
ID NO:X (i.e., within the region of SEQ ID NO:X bounded by the 5' NT and the ^' NT 
of the clone defined in Table XIII) are synthesized and used to amplify the desired cDNA 
using the deposited cDNA plasmid as a template. The polymerase chain reaction is carried 
out under routine conditions, for instance, in 25 ul of reaction mixture with 0.5 ug of the 
above cDNA template. A convenient reaction mixture is 1.5-5 mM MgCl 2 , 0.01% (w/v) 
gelatin, 20 uM each of dATP, dCTP, dGTP, dTTP, 25 pmol of each primer and 0.25 Unit 
of Taq polymerase. Thirty five cycles of PCR (denaturation at 94 degree C for 1 min; 
annealing at 55 degree C for 1 min; elongation at 72 degree C for 1 min) are performed 
with a Perkin-Elmer Cetus automated thermal cycler. The amplified product is analyzed 
by agarose gel electrophoresis and the DNA band with expected molecular weight is 
excised and purified. The PCR product is verified to be the selected sequence by 
subcloning and sequencing the DNA product. 

Several methods are available for the identification of the 5' or 3' non-coding 
portions of a gene which may not be present in the deposited clone. These methods include 
but are not limited to, filter probing, clone enrichment using specific probes, and protocols 
similar or identical to 5' and 3' "RACE" protocols which are well known in the art. For 
instance, a method similar to 5' RACE is available for generating the missing 5' end of a 
desired full-length transcript. (Fromont-Racine et al., Nucleic Acids Res. 2 1(7): 1683- 1684 
(1993).) 

Briefly, a specific RNA oligonucleotide is ligated to the 5' ends of a population of 
RNA presumably containing full-length gene RNA transcripts. A primer set containing a 
primer specific to the ligated RNA oligonucleotide and a primer specific to a known 
sequence of the gene of interest is used to PCR amplify the 5' portion of the desired full- 
length gene. This amplified product may then be sequenced and used to generate the full 
length gene. 
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This above method starts with total RNA isolated from the desired source, although 
poly-A+ RNA can be used. The RNA preparation can then be treated with phosphatase if 
necessary to eliminate 5' phosphate groups on degraded or damaged RNA which may 
interfere with the later RNA ligase step. The phosphatase should then be inactivated and 
5 the RNA treated with tobacco acid pyrophosphatase in order to remove the cap structure 
present at the 5' ends of messenger RNAs. This reaction leaves a 5' phosphate group at 
the 5' end of the cap cleaved RNA which can then be Iigated to an RNA oligonucleotide 
using T4 RNA ligase. 

This modified RNA preparation is used as a template for first strand cDNA 
10 synthesis using a gene specific oligonucleotide. The first strand synthesis reaction is used 
as a template for PCR amplification of the desired 5' end using a primer specific to the 
Iigated RNA oligonucleotide and a primer specific to the known sequence of the gene of 
interest. The resultant product is then sequenced and analyzed to confirm that the 5' end 
sequence belongs to the desired gene. 

15 

Example 2: Isolation of Genomic Clo nes Corresponding to a Polynucleotide 

A human genomic PI library (Genomic Systems, Inc.) is screened by PCR using 
primers selected for the cDNA sequence corresponding to SEQ ID NO:X., according to the 
method described in Example 1 . (See also, Sambrook.) 

20 

Example 3: Tis sue Distribution of Polypeptide 

Tissue distribution of mRNA expression of polynucleotides of the present 
invention is determined using protocols for Northern blot analysis, described by, among 
others, Sambrook et al. For example, a cDNA probe produced by the method described in 
25 Example 1 is labeled with P 32 using the rediprime™ DNA labeling system (Amersham 
Life Science), according to manufacturer's instructions. After labeling, the probe is 
purified using CHROMA SPIN-100™ column (Clontech Laboratories, Inc.), according to 
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manufacturer's protocol number PT 1200-1. The purified labeled probe is then used to 
examine various human tissues for mRNA expression. 

Multiple Tissue Northern (MTN) blots containing various human tissues (H) or 
human immune system tissues (IM) (Clontech) are examined with the labeled probe using 
ExpressHyb™ hybridization solution (Clontech) according to manufacturer's protocol 
number PT1 190-1. Following hybridization and washing, the blots are mounted and 
exposed to film at -70 degree C overnight, and the films developed according to standard 
procedures. 

Example 4: Chro mosomal Mapping of the Polynucleotides 

An oligonucleotide primer set is designed according to the sequence at the 5' end of 
SEQ ID NO:X. This primer preferably spans about 100 nucleotides. This primer set is 
then used in a polymerase chain reaction under the following set of conditions : 30 
seconds,95 degree C; 1 minute, 56 degree C; 1 minute, 70 degree C. This cycle is repeated 
32 times followed by one 5 minute cycle at 70 degree C. Human, mouse, and hamster 
DNA is used as template in addition to a somatic cell hybrid panel containing individual 
chromosomes or chromosome fragments (Bios, Inc). The reactions is analyzed on either 
8% polyacrylamide gels or 3.5 % agarose gels. Chromosome mapping is determined by 
the presence of an approximately 100 bp PCR fragment in the particular somatic cell 
hybrid. 

Example 5: Bacterial Expression of a Polypep tide 

A polynucleotide encoding a polypeptide of the present invention is amplified 
using PCR oligonucleotide primers corresponding to the 5' and 3' ends of the DNA 
sequence, as outlined in Example 1, to synthesize insertion fragments. The primers used to 
amplify the cDNA insert should preferably contain restriction sites, such as BamHI and 
Xbal, at the 5' end of the primers in order to clone the amplified product into the 
expression vector. For example, BamHI and Xbal correspond to the restriction enzyme 
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sites on the bacterial expression vector pQE-9. (Qiagen, Inc., Chatsworth, CA). This 
plasmid vector encodes antibiotic resistance (Amp 1 "), a bacterial origin of replication (ori), 
an IPTG-regulatable promoter/operator (P/O), a ribosome binding site (RBS), a 6-histidine 
tag (6-His), and restriction enzyme cloning sites. 

The pQE-9 vector is digested with BamHI and Xbal and the amplified fragment is 
ligated into the pQE-9 vector maintaining the reading frame initiated at the bacterial RBS. 
The ligation mixture is then used to transform the E. coli strain M15/rep4 (Qiagen, Inc.) 
which contains multiple copies of the plasmid pREP4, which expresses the lad repressor 
and also confers kanamycin resistance (Kan r ). Transformants are identified by their ability 
to grow on LB plates and ampicillin/kanamycin resistant colonies are selected. Plasmid 
DNA is isolated and confirmed by restriction analysis. 

Clones containing the desired constructs are grown overnight (O/N) in liquid 
culture in LB media supplemented with both Amp (100 ug/ml) and Kan (25 ug/ml). The 
O/N culture is used to inoculate a large culture at a ratio of 1:100 to 1:250. The ceils are 
grown to an optical density 600 (O.D. 600 ) of between 0.4 and 0.6. IPTG (Isopropyl-B-D- 
thiogalacto pyranoside) is then added to a final concentration of 1 mM. IPTG induces by 
inactivating the lad repressor, clearing the P/O leading to increased gene expression. 

Cells are grown for an extra 3 to 4 hours. Cells are then harvested by 
centrifugation (20 mins at 6000Xg). The cell pellet is solubilized in the chaotropic agent 6 
Molar Guanidine HC1 by stirring for 3-4 hours at 4 degree C. The cell debris is removed 
by centrifugation, and the supernatant containing the polypeptide is loaded onto a nickel- 
nitrilo-tri-acetic acid ("Ni-NTA") affinity resin column (available from QIAGEN, Inc., 
supra). Proteins with a 6 x His tag bind to the Ni-NTA resin with high affinity and can be 
purified in a simple one-step procedure (for details see: The QIAexpressionist (1995) 
QIAGEN, Inc., supra). 

Briefly, the supernatant is loaded onto the column in 6 M guanidine-HCI, pH 8, the 
column is first washed with 10 volumes of 6 M guanidine-HCI, pH 8, then washed with 10 
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volumes of 6 M guanidine-HCl pH 6, and finally the polypeptide is eluted with 6 M 
guanidine-HCl, pH 5. 

The purified protein is then renatured by dialyzing it against phosphate- buffered 
saline (PBS) or 50 mM Na-acetate, pH 6 buffer plus 200 mM NaCl. Alternatively, the 
protein can be successfully refolded while immobilized on the Ni-NTA column. The 
recommended conditions are as follows: renature using a linear 6M-1M urea gradient in 
500 mM NaCl, 20% glycerol, 20 mM Tris/HCl pH 7.4, containing protease inhibitors. The 
renaturation should be performed over a period of 1.5 hours or more. After renaturation 
the proteins are eluted by the addition of 250 mM immidazole. Immidazole is removed by 
a final dialyzing step against PBS or 50 mM sodium acetate pH 6 buffer plus 200 mM 
NaCl. The purified protein is stored at 4 degree C or frozen at -80 degree C. 

In addition to the above expression vector, the present invention further includes an 
expression vector comprising phage operator and promoter elements operatively linked to 
a polynucleotide of the present invention, called pHE4a. (ATCC Accession Number 
209645, deposited on February 25, 1998.) This vector contains: 1) a 

neomycinphosphotransferase gene as a selection marker, 2) an E. coli origin of replication, 
3) a T5 phage promoter sequence, 4) two lac operator sequences, 5) a Shine-Delgarno 
sequence, and 6) the lactose operon repressor gene (laclq). The origin of replication (oriC) 
is derived from pUC19 (LTI, Gaithersburg, MD). The promoter sequence and operator 
sequences are made synthetically. 

DNA can be inserted into the pHEa by restricting the vector with Ndel and Xbal, 
BamHI, Xhol, or Asp718, running the restricted product on a gel, and isolating the larger 
fragment (the stuffer fragment should be about 310 base pairs). The DNA insert is 
generated according to the PGR protocol described in Example 1, using PCR primers 
having restriction sites for Ndel (5' primer) and Xbal, BamHI, Xhol, or Asp718 (3' 
primer). The PCR insert is gel purified and restricted with compatible enzymes. The 
insert and vector are ligated according to standard protocols. 
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The engineered vector could easily be substituted in the above protocol to express 
protein in a bacterial system. 


Example 6: Purification of a Pol ypeptide from an Inclusion Body 

The following alternative method can be used to purify a polypeptide expressed in 
E coli when it is present in the form of inclusion bodies. Unless otherwise specified, all of 
the following steps are conducted at 4-10 degree C. 

Upon completion of the production phase of the E. coli fermentation, the cell 
culture is cooled to 4-10 degree C and the cells harvested by continuous centrifugation at 
15,000 rpm (Heraeus Sepatech). On the basis of the expected yield of protein per unit 
weight of cell paste and the amount of purified protein required, an appropriate amount of 
cell paste, by weight, is suspended in a buffer solution containing 100 mM Tris, 50 mM 
EDTA, pH 7.4. The ceils are dispersed to a homogeneous suspension using a high shear 
mixer. 

The cells are then lysed by passing the solution through a microfluidizer 
(Microfuidics, Corp. or APV Gaulin, Inc.) twice at 4000-6000 psi. The homogenate is 
then mixed with NaCl solution to a final concentration of 0.5 M NaCl, followed by 
centrifugation at 7000 xg for 15 min. The resultant pellet is washed again using 0.5M 
NaCl, 100 mM Tris, 50 mM EDTA, pH 7.4. 

The resulting washed inclusion bodies are solubilized with 1.5 M guanidine 
hydrochloride (GuHCI) for 2-4 hours. After 7000 xg centrifugation for 15 min., the pellet 
is discarded and the polypeptide containing supernatant is incubated at 4 degree C 
overnight to allow further GuHCI extraction. 

Following high speed centrifugation (30,000 xg) to remove insoluble particles, the 
GuHCI solubilized protein is refolded by quickly mixing the GuHCI extract with 20 
volumes of buffer containing 50 mM sodium, pH 4.5, 150 mM NaCl, 2 mM EDTA by 
vigorous stirring. The refolded diluted protein solution is kept at 4 degree C without 
mixing for 12 hours prior to further purification steps. 
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To clarify the refolded polypeptide solution, a previously prepared tangential 
filtration unit equipped with 0.16 um membrane filter with appropriate surface area (e.g., 
Filtron), equilibrated with 40 mM sodium acetate, pH 6.0 is employed. The filtered 
sample is loaded onto a cation exchange resin (e.g., Poros HS-50, Perseptive Biosystems). 
The column is washed with 40 mM sodium acetate, pH 6,0 and eluted with 250 mM, 500 
mM, 1000 mM, and 1500 mM NaCl in the same buffer, in a stepwise manner. The 
absorbance at 280 nm of the effluent is continuously monitored. Fractions are collected 
and further analyzed by SDS-PAGE. 

Fractions containing the polypeptide are then pooled and mixed with 4 volumes of 
water. The diluted sample is then loaded onto a previously prepared set of tandem 
columns of strong anion (Poros HQ-50, Perseptive Biosystems) and weak anion (Poros 
CM-20, Perseptive Biosystems) exchange resins. The columns are equilibrated with 40 
mM sodium acetate, pH 6.0. Both columns are washed with 40 mM sodium acetate, pH 
6.0, 200 mM NaCl. The CM-20 column is then eluted using a 10 column volume linear 
gradient ranging from 0.2 M NaCl, 50 mM sodium acetate, pH 6.0 to 1.0 M NaCl, 50 mM 
sodium acetate, pH 6.5. Fractions are collected under constant A 280 monitoring of the 
effluent. Fractions containing the polypeptide (determined, for instance, by 16% SDS- 
PAGE) are then pooled. 

The resultant polypeptide should exhibit greater than 95% purity after the above 
refolding and purification steps. No major contaminant bands should be observed from 
Commassie blue stained 16% SDS-PAGE gel when 5 ug of purified protein is loaded. The 
purified protein can also be tested for endotoxin/LPS contamination, and typically the LPS 
content is less than 0.1 ng/ml according to LAL assays. 

Example 7: Cloning and Expre s sion of a Polypeptide in a Baculovirus Exp ression 
System 

In this example, the plasmid shuttle vector pA2 is used to insert a polynucleotide 
into a baculovirus to express a polypeptide. This expression vector contains the strong 
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polyhedrin promoter of the Autographa califomica nuclear polyhidrosis virus (AcMNPV) 
followed by convenient restriction sites such as BamHI, Xba I and Asp718. The 
polyadenylation site of the simian virus 40 ("SV40") is used for efficient polyadenylation. 
For easy selection of recombinant virus, the plasmid contains the beta-galactosidase gene 
from E. coli under control of a weak Drosophila promoter in the same orientation, 
followed by the polyadenylation signal of the polyhedrin gene. The inserted genes are 
flanked on both sides by viral sequences for cell-mediated homologous recombination with 
wild-type viral DNA to generate a viable virus that express the cloned polynucleotide. 

Many other baculovirus vectors can be used in place of the vector above, such as 
pAc373, pVL941, and pAcIMl, as one skilled in the art would readily appreciate, as long 
as the construct provides appropriately located signals for transcription, translation, 
secretion and the like, including a signal peptide and an in-frame AUG as required. Such 
vectors are described, for instance, in Luckow et ah, Virology 170:31-39 (1989). 

Specifically, the cDNA sequence contained in the deposited clone, including the 
AUG initiation codon and the naturally associated leader sequence identified in Table XIII, 
is amplified using the PCR protocol described in Example 1. If the naturally occurring 
signal sequence is used to produce the secreted protein, the pA2 vector does not need a 
second signal peptide. Alternatively, the vector can be modified (pA2 GP) to include a 
baculovirus leader sequence, using the standard methods described in Summers et al., "A 
Manual of Methods for Baculovirus Vectors and Insect Cell Culture Procedures," Texas 
Agricultural Experimental Station Bulletin No. 1555 (1987). 

The amplified fragment is isolated from a 1% agarose gel using a commercially 
available kit ("Geneclean," BIO 101 Inc., La Jolla, Ca.). The fragment then is digested 
with appropriate restriction enzymes and again purified on a 1% agarose gel. 

The plasmid is digested with the corresponding restriction enzymes and optionally, 
can be dephosphorylated using calf intestinal phosphatase, using routine procedures known 
in the art. The DNA is then isolated from a 1% agarose gel using a commercially available 
kit ("Geneclean" BIO 101 Inc., La Jolla, Ca.). 
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The fragment and the dephosphorylated plasmid are ligated together with T4 DNA 
ligase. E. coli HB101 or other suitable E. coli hosts such as XL-1 Blue (Stratagene 
Cloning Systems, La Jolla, CA) cells are transformed with the ligation mixture and spread 
on culture plates. Bacteria containing the plasmid are identified by digesting DNA from 
individual colonies and analyzing the digestion product by gel electrophoresis. The 
sequence of the cloned fragment is confirmed by DNA sequencing. 

Five ug of a plasmid containing the polynucleotide is co-transfected with 1.0 ug of 
a commercially available linearized baculovirus DNA ("BaculoGoId™ baculovirus DNA", 
Pharmingen, San Diego, CA), using the lipofection method described by Feigner et ah, 
Proc. Natl. Acad. Sci. USA 84:7413-7417 (1987). One ug of BaculoGoId™ virus DNA 
and 5 ug of the plasmid are mixed in a sterile well of a microtiter plate containing 50 ul of 
serum-free Grace's medium (Life Technologies Inc., Gaithersburg, MD). Afterwards, 10 
ul Lipofectin plus 90 ul Grace's medium are added, mixed and incubated for 15 minutes at 
room temperature. Then the transfection mixture is added drop-wise to Sf9 insect cells 
(ATCC CRL 1711) seeded in a 35 mm tissue culture plate with 1 ml Grace's medium 
without serum. The plate is then incubated for 5 hours at 27 degrees C. The transfection 
solution is then removed from the plate and 1 ml of Grace's insect medium supplemented 
with 10% fetal calf serum is added. Cultivation is then continued at 27 degrees C for four 
days. 

After four days the supernatant is collected and a plaque assay is performed, as 
described by Summers and Smith, supra. An agarose gel with "Blue Gal" (Life 
Technologies Inc., Gaithersburg) is used to allow easy identification and isolation of gal- 
expressing clones, which produce blue-stained plaques. (A detailed description of a 
"plaque assay" of this type can also be found in the user's guide for insect cell culture and 
baculovirology distributed by Life Technologies Inc., Gaithersburg, page 9-10.) After 
appropriate incubation, blue stained plaques are picked with the tip of a micropipettor (e.g., 
Eppendorf). The agar containing the recombinant viruses is then resuspended in a 
microcentrifuge tube containing 200 ul of Grace's medium and the suspension containing 
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the recombinant baculo virus is used to infect Sf9 cells seeded in 35 mm dishes. Four days 
later the supernatants of these culture dishes are harvested and then they are stored at 4 
degree C. 

To verify the expression of the polypeptide, Sf9 cells are grown in Grace's medium 
supplemented with 10% heat-inactivated FBS. The cells are infected with the recombinant 
baculovirus containing the polynucleotide at a multiplicity of infection ("MOI") of about 2. 
If radiolabeled proteins are desired, 6 hours later the medium is removed and is replaced 
with SF900 II medium minus methionine and cysteine (available from Life Technologies 
Inc., Rockville, MD). After 42 hours, 5 uCi of 35 S-methionine and 5 uCi 35 S-cysteine 
(available from Amersham) are added. The cells are further incubated for 16 hours and 
then are harvested by centrifugation. The proteins in the supernatant as well as the 
intracellular proteins are analyzed by SDS-PAGE followed by autoradiography (if 
radiolabeled). 

Microsequencing of the amino acid sequence of the amino terminus of purified 
protein may be used to determine the amino terminal sequence of the produced protein. 


Example 8: E xpression of a Polype p tide in Mammalian Oils 

The polypeptide of the present invention can be expressed in a mammalian cell. A 
typical mammalian expression vector contains a promoter element, which mediates the 
initiation of transcription of mRNA, a protein coding sequence, and signals required for the 
termination of transcription and polyadenylation of the transcript. Additional elements 
include enhancers, Kozak sequences and intervening sequences flanked by donor and 
acceptor sites for RNA splicing. Highly efficient transcription is achieved with the early 
and late promoters from SV40, the long terminal repeats (LTRs) from Retroviruses, e.g., 
RSV, HTLVI, HI VI and the early promoter of the cytomegalovirus (CMV). However, 
cellular elements can also be used (e.g., the human actin promoter). 
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Suitable expression vectors for use in practicing the present invention include, for 
example, vectors such as pSVL and pMSG (Pharmacia, Uppsala, Sweden), pRSVcat 
(ATCC 37152), pSV2dhfr (ATCC 37146), pBC12MI (ATCC 67109), pCMVSport 2.0, 
and pCMVSport 3.0, Mammalian host cells that could be used include, human Hela, 293, 
H9 and Jurkat cells, mouse NIH3T3 and C127 cells, Cos 1, Cos 7 and CV1, quail QC1-3 
cells, mouse L cells and Chinese hamster ovary (CHO) cells. 

Alternatively, the polypeptide can be expressed in stable cell lines containing the 
polynucleotide integrated into a chromosome. The co-transfection with a selectable 
marker such as dhfr, gpt, neomycin, hygromycin allows the identification and isolation of 
the transfected cells. 

The transfected gene can also be amplified to express large amounts of the encoded 
protein. The DHFR (dihydrofolate reductase) marker is useful in developing cell lines that 
carry several hundred or even several thousand copies of the gene of interest. (See, e.g.. 
Alt, F. W., et al., J. Biol. Chem. 253:1357-1370 (1978); Hamlin, J. L. and Ma, C, 
Biochem. et Biophys. Acta, 1097:107-143 (1990); Page, M. J. and Sydenham, M. A., 
Biotechnology 9:64-68 (1991).) Another useful selection marker is the enzyme glutamine 
synthase (GS) (Murphy et al., Biochem J. 227:277-279 (1991); Bebbington et al., 
Bio/Technology 10:169-175 (1992). Using these markers, the mammalian cells are grown 
in selective medium and the cells with the highest resistance are selected. These cell lines 
contain the amplified gene(s) integrated into a chromosome. Chinese hamster ovary 
(CHO) and NSO cells are often used for the production of proteins. 

Derivatives of the plasmid pSV2-dhfr (ATCC Accession No. 37146), the 
expression vectors pC4 (ATCC Accession No. 209646) and pC6 (ATCC Accession 
No.209647) contain the strong promoter (LTR) of the Rous Sarcoma Virus (Cullen et al., 
Molecular and Cellular Biology, 438-447 (March, 1985)) plus a fragment of the CMV- 
enhancer (Boshart et al., Cell 41:521-530 (1985).) Multiple cloning sites, e.g., with the 
restriction enzyme cleavage sites BamHI, Xbal and Asp718, facilitate the cloning of the 
gene of interest. The vectors also contain the 3' intron, the polyadenylation and 
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termination signal of the rat preproinsulin gene, and the mouse DHFR gene under control 

of the SV40 early promoter. 

Specifically, the plasmid pC6, for example, is digested with appropriate restriction 

enzymes and then dephosphorylated using calf intestinal phosphates by procedures known 

in the art. The vector is then isolated from a 1 % agarose gel. 

A polynucleotide of the present invention is amplified according to the protocol 

outlined in Example 1. If the naturally occurring signal sequence is used to produce the 

secreted protein, the vector does not need a second signal peptide. Alternatively, if the 

naturally occurring signal sequence is not used, the vector can be modified to include a 

heterologous signal sequence. (See, e.g., WO 96/34891.) 

The amplified fragment is isolated from a 1% agarose gel using a commercially 
available kit ("Geneclean," BIO 101 Inc., La Jolla, Ca.). The fragment then is digested 
with appropriate restriction enzymes and again purified on a 1% agarose gel. 

The amplified fragment is then digested with the same restriction enzyme and 
purified on a 1% agarose gel. The isolated fragment and the dephosphorylated vector are 
then ligated with T4 DNA ligase. E. coli HB101 or XL-1 Blue cells are then transformed 
and bacteria are identified that contain the fragment inserted into plasmid pC6 using, for 
instance, restriction enzyme analysis. 

Chinese hamster ovary cells lacking an active DHFR gene is used for transfection. 
Five /Ag of the expression plasmid pC6 a pC4 is cotransfected with 0.5 ug of the plasmid 
pSVneo using lipofectin (Feigner et al., supra). The plasmid pSV2-neo contains a 
dominant selectable marker, the neo gene from Tn5 encoding an enzyme that confers 
resistance to a group of antibiotics including G4I8. The cells are seeded in alpha minus 
MEM supplemented with 1 mg/ml G418. After 2 days, the cells are trypsinized and 
seeded in hybridoma cloning plates (Greiner, Germany) in alpha minus MEM 
supplemented with 10, 25, or 50 ng/ml of metothrexate plus 1 mg/ml G418. After about 
10-14 days single clones are trypsinized and then seeded in 6-well petri dishes or 10 ml 
flasks using different concentrations of methotrexate (50 nM, 100 nM, 200 nM, 400 nM, 
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800 nM). Clones growing at the highest concentrations of methotrexate are then 
transferred to new 6-welI plates containing even higher concentrations of methotrexate (I 
uM, 2 uM, 5 uM, 10 mM, 20 mM). The same procedure is repeated until clones are 
obtained which grow at a concentration of 100 - 200 uM. Expression of the desired gene 
product is analyzed, for instance, by SDS-PAGE and Western blot or by reversed phase 
HPLC analysis. 


Example 9: Protein Fusions 

The polypeptides of the present invention are preferably fused to other proteins. 
These fusion proteins can be used for a variety of applications. For example, fusion of the 
present polypeptides to His-tag, HA- tag, protein A, IgG domains, and maltose binding 
protein facilitates purification. (See Example 5; see also EP A 394,827; Traunecker, et al., 
Nature 331:84-86 (1988).) Similarly, fusion to IgG-1, lgG-3, and albumin increases the 
halflife time in vivo. Nuclear localization signals fused to the polypeptides of the present 
invention can target the protein to a specific subcellular localization, while covalent 
heterodimer or homodimers can increase or decrease the activity of a fusion protein. 
Fusion proteins can also create chimeric molecules having more than one function. 
Finally, fusion proteins can increase solubility and/or stability of the fused protein 
compared to the non-fused protein. All of the types of fusion proteins described above can 
be made by modifying the following protocol, which outlines the fusion of a polypeptide to 
an IgG molecule, or the protocol described in Example 5. 

Briefly, the human Fc portion of the IgG molecule can be PCR amplified, using 
primers that span the 5' and 3' ends of the sequence described below. These primers also 
should have convenient restriction enzyme sites that will facilitate cloning into an 
expression vector, preferably a mammalian expression vector. 

For example, if pC4 (Accession No. 209646) is used, the human Fc portion can be 
ligated into the BamHI cloning site. Note that the 3' BamHI site should be destroyed. 
Next, the vector containing the human Fc portion is re-restricted with BamHI, linearizing 
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the vector, and a polynucleotide of the present invention, isolated by the PCR protocol 
described in Example 1, is ligated into this BamHI site. Note that the polynucleotide is 
cloned without a stop codon, otherwise a fusion protein will not be produced. 

If the naturally occurring signal sequence is used to produce the secreted protein, 
pC4 does not need a second signal peptide. Alternatively, if the naturally occurring signal 
sequence is not used, the vector can be modified to include a heterologous signal sequence. 
(See, e.g., WO 96/34891.) 

Human IgG Fc region: 

GGGATCCGGAGCCCAAATCTTCTGACAAAACTCACACATGCCCACCGTGCCCA 

GCACCTGAATTCGAGGGTGCACCGTCAGTCTTCCTCTTCCCCCCAAAACCCAA 

GGACACCCTCATGATCTCCCGGACTCCTGAGGTCACATGCGTGGTGGTGGACG 

TAAGCCACGAAGACCCTGAGGTCAAGTTCAACTGGTACGTGGACGGCGTGGA 

GGTGCATAATGCCAAGACAAAGCCGCGGGAGGAGCAGTACAACAGCACGTAC 

CGTGTGGTCAGCGTCCTCACCGTCCTGCACCAGGACTGGCTGAATGGCAAGGA 

GTACAAGTGCAAGGTCTCCAACAAAGCCCTCCCAACCCCCATCGAGAAAACCA 

TCTCCAAAGCCAAAGGGCAGCCCCGAGAACCACAGGTGTACACCCTGCCCCCA 

TCCCGGGATGAGCTGACCAAGAACCAGGTCAGCCTGACCTGCCTGGTCAAAGG 

CTTCTATCCAAGCGACATCGCCGTGGAGTGGGAGAGCAATGGGCAGCCGGAG 

AACAACTACAAGACCACGCCTCCCGTGCTGGACTCCGACGGCTCCTTCTTCCTC 

TACAGCAAGCTCACCGTGGACAAGAGCAGGTGGCAGCAGGGGAACGTCTTCT 

CATGCTCCGTGATGCATGAGGCTCTGCACAACCACTACACGCAGAAGAGCCTC 

TCCCTGTCTCCGGGTAAATGAGTGCGACGGCCGCGACTCTAGAGGAT (SEQ ID 
NO:l) 

Example 10: Produ ction of an Antibody from a Polypeptide 

The antibodies of the present invention can be prepared by a variety of methods. 
(See, Current Protocols, Chapter 2.) As one example of such methods, cells expressing a 
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polypeptide of the present invention is administered to an anima] to induce the production 
of sera containing polyclonal antibodies. In a preferred method, a preparation of the 
secreted protein is prepared and purified to render it substantially free of natural 
contaminants. Such a preparation is then introduced into an animal in order to produce 
polyclonal antisera of greater specific activity. 

In the most preferred method, the antibodies of the present invention are 
monoclonal antibodies (or protein binding fragments thereof). Such monoclonal 
antibodies can be prepared using hybridoma technology. (Kdhler et al., Nature 256:495 

(1975) ; Kohler et al., Eur. J. Immunol. 6:511 (1976); Kohler et al., Eur. J. Immunol. 6:292 

(1976) ; Hammerling et al., in: Monoclonal Antibodies and T-Cell Hybridomas, Elsevier, 
N.Y., pp. 563-681 (1981).) In general, such procedures involve immunizing an animal 
(preferably a mouse) with polypeptide or, more preferably, with a secreted polypeptide- 
expressing cell. Such cells may be cultured in any suitable tissue culture medium; 
however, it is preferable to culture cells in Earle's modified Eagle's medium supplemented 
with 10% fetal bovine serum (inactivated at about 56 degrees C), and supplemented with 
about 10 g/1 of nonessential amino acids, about 1,000 U/ml of penicillin, and about 100 
ug/ml of streptomycin. 

The splenocytes of such mice are extracted and fused with a suitable myeloma cell 
line. Any suitable myeloma cell line may be employed in accordance with the present 
invention; however, it is preferable to employ the parent myeloma cell line (SP20), 
available from the ATCC. After fusion, the resulting hybridoma cells are selectively 
maintained in HAT medium, and then cloned by limiting dilution as described by Wands et 
al. (Gastroenterology 80:225-232 (1981).) The hybridoma ceils obtained through such a 
selection are then assayed to identify clones which secrete antibodies capable of binding 
the polypeptide. 

Alternatively, additional antibodies capable of binding to the polypeptide can be 
produced in a two-step procedure using anti -idiotypic antibodies. Such a method makes 
use of the fact that antibodies are themselves antigens, and therefore, it is possible to obtain 
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an antibody which binds to a second antibody. In accordance with this method, protein 
specific antibodies are used to immunize an animal, preferably a mouse. The splenocytes 
of such an animal are then used to produce hybridoma cells, and the hybridoma cells are 
screened to identify clones which produce an antibody whose ability to bind to the protein- 
specific antibody can be blocked by the polypeptide. Such antibodies comprise anti- 
idiotype antibodies to the protein- specific antibody and can be used to immunize an 
animal to induce formation of further protein-specific antibodies. 

It will be appreciated that Fab and F(ab T )2 and other fragments of the antibodies of 
the present invention may be used according to the methods disclosed herein. Such 
fragments are typically produced by proteolytic cleavage, using enzymes such as papain 
(to produce Fab fragments) or pepsin (to produce F(ab')2 fragments). Alternatively, 
secreted protein-binding fragments can be produced through the application of 
recombinant DNA technology or through synthetic chemistry. 

For in vivo use of antibodies in humans, it may be preferable to use "humanized" 
chimeric monoclonal antibodies. Such antibodies can be produced using genetic 
constructs derived from hybridoma cells producing the monoclonal antibodies described 
above. Methods for producing chimeric antibodies are known in the art. (See, for review, 
Morrison, Science 229:1202 (1985); Oi et al., BioTechniques 4:214 (1986); Cabilly et al., 
U.S. Patent No. 4,816,567; Taniguchi et al., EP 171496; Morrison et aL, EP 173494; 
Neuberger et al., WO 8601533; Robinson et al., WO 8702671; Boulianne et al., Nature 
312:643 (1984); Neuberger et al., Nature 314:268 (1985).) 

Example 1 1: Production Of Sec reted Protein For High-Throug h put Screening 
Assays 

The following protocol produces a supernatant containing a polypeptide to be 
tested. This supernatant can then be used in the Screening Assays described in Examples 
13-20. 
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First, dilute Pol y-D-Ly sine (644 587 Boehringer-Mannheim) stock solution 
(lmg/ml in PBS) 1:20 in PBS (w/o calcium or magnesium 17-5 16F Biowhittaker) for a 
working solution of 50ug/ml. Add 200 ul of this solution to each well (24 well plates) and 
incubate at RT for 20 minutes. Be sure to distribute the solution over each well (note: a 
12-channel pipetter may be used with tips on every other channel). Aspirate off the Poly- 
D-Lysine solution and rinse with 1ml PBS (Phosphate Buffered Saline). The PBS should 
remain in the well until just prior to plating the cells and plates may be poly-lysine coated 
in advance for up to two weeks. 

Plate 293T cells (do not carry cells past P+20) at 2 x 10 3 cells/well in .5ml 
DMEM(Dulbecco's Modified Eagle Medium) (with 4.5 G/L glucose and L-glutamine (12- 
604F Biowhittaker))/10% heat inactivated FBS(14-503F Biowhittaker)/lx Penstrep(17- 
602E Biowhittaker). Let the cells grow overnight. 

The next day, mix together in a sterile solution basin: 300 ul Lipofectamine 
(18324-012 Gibco/BRL) and 5ml Optimem I (31985070 Gibco/BRL)/96-well plate. With 
a small volume multi-channel pipetter, aliquot approximately 2ug of an expression vector 
containing a polynucleotide insert, produced by the methods described in Examples 8 or 9, 
into an appropriately labeled 96-well round bottom plate. With a multi-channel pipetter, 
add 50ul of the Lipofectamine/Optimem I mixture to each well. Pipette up and down 
gently to mix. Incubate at RT 15-45 minutes. After about 20 minutes, use a multi-channel 
pipetter to add 150ul Optimem I to each well. As a control, one plate of vector DNA 
lacking an insert should be transfected with each set of transfections. 

Preferably, the transfection should be performed by tag-teaming the following 
tasks. By tag-teaming, hands on time is cut in half, and the cells do not spend too much 
time on PBS. First, person A aspirates off the media from four 24-well plates of cells, and 
then person B rinses each well with .5-lml PBS. Person A then aspirates off PBS rinse, 
and person B, using al2-channel pipetter with tips on every other channel, adds the 200ul 
of DNA/Lipofectamine/Optimem I complex to the odd wells first, then to the even wells, 
to each row on the 24-well plates. Incubate at 37 degrees C for 6 hours. 
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While cells are incubating, prepare appropriate media, either 1%BSA in DMEM 
with lx penstrep, or CHO-5 media (116.6 mg/L of CaC12 (anhyd); 0.00130 mg/L CuS0 4 - 
5H 2 0; 0.050 mg/L of Fe(N0 3 ) 3 -9H 2 0; 0.417 mg/L of FeS0 4 -7H 2 0; 31 1.80 mg/L of Kcl; 
28.64 mg/L of MgCl 2 ; 48.84 mg/L of MgS0 4 ; 6995.50 mg/L of NaCl; 2400.0 mg/L of 
5 NaHCO,; 62.50 mg/L of NaH 2 PO 4 -H 2 0; 71.02 mg/L of Na 2 HP04; .4320 mg/L of ZnS0 4 - 
7H 2 0; .002 mg/L of Arachidonic Acid ; 1 .022 mg/L of Cholesterol; .070 mg/L of DL- 
alpha-Tocopherol-Acetate; 0.0520 mg/L of Linoleic Acid; 0.010 mg/L of Linolenic Acid; 
0.010 mg/L of Myristic Acid; 0.010 mg/L of Oleic Acid; 0.010 mg/L of Palmitric Acid; 
0.010 mg/L of Palmitic Acid; 100 mg/L of Pluronic F-68; 0.010 mg/L of Stearic Acid; 
10 2.20 mg/L of Tween 80; 4551 mg/L of D-Glucose; 130.85 mg/mi of L- Alanine; 147.50 
mg/ml of L-Arginine-HCL; 7.50 mg/ml of L- Asparagine-H 2 0; 6.65 mg/ml of L-Aspartic 
Acid; 29.56 mg/ml of L-Cystine-2HCL-H 2 0; 31.29 mg/ml of L-Cystine-2HCL; 7.35 mg/ml 
of L-Glutamic Acid; 365.0 mg/ml of L-Glutamine; 18.75 mg/ml of Glycine; 52.48 mg/ml 
of L-Histidine-HCL-H 2 0; 106.97 mg/ml of L-Isoleucine; 111.45 mg/ml of L-Leucine; 

15 163.75 mg/ml of L-Lysine HCL; 32.34 mg/ml of L-Methionine; 68.48 mg/ml of L- 

Phenylalainine; 40.0 mg/ml of L-Proline; 26.25 mg/ml of L-Serine; 101.05 mg/ml of L- 
Threonine; 19.22 mg/ml of L-Tryptophan; 91.79 mg/ml of L-Tryrosine-2Na-2H 2 0; 99.65 
mg/ml of L-Valine; 0.0035 mg/L of Biotin; 3.24 mg/L of D-Ca Pantothenate; 1 1.78 mg/L 
of Choline Chloride; 4.65 mg/L of Folic Acid; 15.60 mg/L of i-Inositol; 3.02 mg/L of 

20 Niacinamide; 3.00 mg/L of Pyridoxal HCL; 0.03 1 mg/L of Pyridoxine HCL; 0.3 19 mg/L 
of Riboflavin; 3.17 mg/L of Thiamine HCL; 0.365 mg/L of Thymidine; and 0.680 mg/L 
of Vitamin B 12 ; 25 mM of HEPES Buffer; 239 mg/L of Na Hypoxanthine; 0.105 mg/L of 
Lipoic Acid; 0.081 mg/L of Sodium Putrescine~2HCL; 55.0 mg/L of Sodium Pyruvate; 
0.0067 mg/L of Sodium Selenite; 20uM of Ethanolamine; 0.122 mg/L of Ferric Citrate; 

25 41 .70 mg/L of Methyl-B-Cyclodextrin complexed with Linoleic Acid; 33.33 mg/L of 
Methyl-B-Cyclodextrin complexed with Oleic Acid; and 10 mg/L of Methyl-B- 
Cyclodextrin complexed with Retinal) with 2mm glutamine and lx penstrep. (BSA (81- 
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068-3 Bayer) lOOgm dissolved in 1L DMEM for a 10% BSA stock solution). Filter the 
media and collect 50 ul for endotoxin assay in 15ml polystyrene conical. 

The transfection reaction is terminated, preferably by tag-teaming, at the end of the 
incubation period. Person A aspirates off the transfection media, while person B adds 
L5ml appropriate media to each well. Incubate at 37 degrees C for 45 or 72 hours 
depending on the media used: I%BSA for 45 hours or CHO-5 for 72 hours. 

On day four, using a 300ul multichannel pipetter, aliquot 600ul in one 1ml deep 
well plate and the remaining supernatant into a 2ml deep well. The supeniatants from each 
well can then be used in the assays described in Examples 13-20. 

It is specifically understood that when activity is obtained in any of the assays 
described below using a supernatant, the activity originates from either the polypeptide 
directly (e.g., as a secreted protein) or by the polypeptide inducing expression of other 
proteins, which are then secreted into the supernatant. Thus, the invention further provides 
a method of identifying the protein in the supernatant characterized by an activity in a 
particular assay. 

Example 12: Construction of GAS R eporter Construct 

One signal transduction pathway involved in the differentiation and proliferation of 
cells is called the Jaks-STATs pathway. Activated proteins in the Jaks-STATs pathway 
bind to gamma activation site "GAS" elements or interferon-sensitive responsive element 
("ISRE"), located in the promoter of many genes. The binding of a protein to these 
elements alter the expression of the associated gene. 

GAS and ISRE elements are recognized by a class of transcription factors called 
Signal Transducers and Activators of Transcription, or "STATs." There are six members 
of the STATs family. Statl and Stat3 are present in many cell types, as is Stat2 (as 
response to IFN-alpha is widespread). Stat4 is more restricted and is not in many cell 
types though it has been found in T helper class I, cells after treatment with IL-12. Stat5 
was originally called mammary growth factor, but has been found at higher concentrations 
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in other cells including myeloid ceils. It can be activated in tissue culture cells by many 
cytokines. 

The STATs are activated to translocate from the cytoplasm to the nucleus upon 
tyrosine phosphorylation by a set of kinases known as the Janus Kinase ("Jaks") family. 
Jaks represent a distinct family of soluble tyrosine kinases and include Tyk2, Jakl, Jak2, 
and Jak3. These kinases display significant sequence similarity and are generally 
catalytically inactive in resting cells. 

The Jaks are activated by a wide range of receptors summarized in the Table below. 
(Adapted from review by Schidler and Darnell, Ann. Rev. Biochem. 64:621-51 (1995).) A 
cytokine receptor family, capable of activating Jaks, is divided into two groups: (a) Class 1 
includes receptors for IL-2, IL-3, IL-4, IL-6, IL-7, IL-9, IL-11, IL-12, IL-15, Epo, PRL, 
GH, G-CSF, GM-CSF, LIF, CNTF, and thrombopoietin; and (b) Class 2 includes IFN-a, 
IFN-g, and IL-10. The Class 1 receptors share a conserved cysteine motif (a set of four 
conserved cysteines and one tryptophan) and a WSXWS motif (a membrane proximal 
region encoding Trp-Ser-Xxx-Trp-Ser (SEQ ID NO:2)). 

Thus, on binding of a ligand to a receptor, Jaks are activated, which in turn activate 
STATs, which then translocate and bind to GAS elements. This entire process is 
encompassed in the Jaks-STATs signal transduction pathway. 

Therefore, activation of the Jaks-STATs pathway, reflected by the binding of the 
GAS or the ISRE element, can be used to indicate proteins involved in the proliferation 
and differentiation of cells. For example, growth factors and cytokines are known to 
activate the Jaks-STATs pathway. (See Table below.) Thus, by using GAS elements 
linked to reporter molecules, activators of the Jaks-STATs pathway can be identified. 
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Ligand tyk2 

IFN family 

IFN-a/B + 
IFN-g 

11-10 + 

gp!30 family 

IL-6 (Pleiotrophic) + 

II- 1 I(Pleiotrophic) ? 

OnM(Pleiotrophic) ? 

UF(Pleiotrophic) ? 

CNTF(PIeiotrophic) -/+ 

G-CSF(Pleiotrophic) ? 

IL-12(Pleiotrophic) + 

g-C family 
IL-2 (lymphocytes) 
IL-4 (lymph/mveloid) 
IL-7 (lymphocytes) 
IL-9 (lymphocytes) 
IL-13 (lymphocyte) 
IL-15 ? 

gpl40 family 
IL~3 (myeloid) 
IL-5 (myeloid) 
GM-CSF (myeloid) 

Growth hormone, family 
GH 9 
PRL '> 
EPO ? 

Receptor Tyrosine Kinases 
EGF 9 
PDGF 7 
CSF-J 9 


JAKs 

lakl Jak2 


STATS GASfelemftnt^ nr kpf 


Jak3 


+/- 


1.2.3 
1 

13 


1.3 
1.3 
1.3 
1.3 


1.3 
1.3 


1.3.5 

6 
5 
5 
6 
5 


5 

1.3.5 

5 


1.3 

1.3 
1.3 


ISRE 

GAS (IRFl>Lys6>IFP) 


GAS (IRFl>Lys6>IFP) 


GAS 

GAS (IRF1 = IFP »Lv6)fIeH) 

GAS 

GAS 

GAS 

GAS 


GAS (IRFl>IFP»Ly6) 

GAS 

GAS 


GAS(B-CAS >IRF1 =IFP»Ly 6 ) 


GAS (IRFI) 
GAS (not IRFI ) 
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To construct a synthetic GAS containing promoter element, which is used in the 
Biological Assays described in Examples 13-14, a PCR based strategy is employed to 
generate a GAS-SV40 promoter sequence. The 5' primer contains four tandem copies of 
the GAS binding site found in the IRF1 promoter and previously demonstrated to bind 
STATs upon induction with a range of cytokines (Rothman et al., Immunity 1:457-468 
(1994).), although other GAS or ISRE elements can be used instead. The 5' primer also 
contains 18bp of sequence complementary to the SV40 early promoter sequence and is 
flanked with an Xhol site. The sequence of the 5' primer is: 

5':GCGCCTCGAGATTTCCCCGAAATCTAGATTTCCCCGAAATGATTTCCCCGAA 
ATGATTTCCCCGAAATATCTGCCATCTCAATTAG:3' (SEQ ID NO:3) 

The downstream primer is complementary to the S V40 promoter and is flanked 
with a Hind III site: 5 ' :GCGGC AAGCTTTTTGC AAAGCCT AGGC :3 ' (SEQ ID NO:4) 

PCR amplification is performed using the S V40 promoter template present in the 
B-gal:promoter plasmid obtained from Clontech. The resulting PCR fragment is digested 
with Xhol/Hind III and subcloned into BLSK2-. (Stratagene.) Sequencing with forward 
and reverse primers confirms that the insert contains the following sequence: 
5 ' : CTCGAG ATTTCCCCG A A ATCTAGATTTCCCCG A AATG ATTTCCCCG A A ATG A 
TTTCCCCGAAATATCTGCCATCTCAATTAGTCAGCAACCATAGTCCCGCCCCTA 
ACTCCGCCCATCCCGCCCCTAACTCCGCCCAGTTCCGCCCATTCTCCGCCCCAT 
GGCTGACTAATTTTTTTTATTTATGCAGAGGCCGAGGCCGCCTCGGCCTCTGAG 
CTATTCCAGAAGTAGTGAGGAGGCTTTTTTGGAGGCCTAGGCTTTTGCAAAAA 
GCTT:3' (SEQ ID NO :5) 

With this GAS promoter element linked to the SV40 promoter, a GAS:SEAP2 
reporter construct is next engineered. Here, the reporter molecule is a secreted alkaline 
phosphatase, or "SEAP." Clearly, however, any reporter molecule can be instead of 
SEAP, in this or in any of the other Examples. Well known reporter molecules that can be 
used instead of SEAP include chloramphenicol acety transferase (CAT), luciferase, 
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alkaline phosphatase, B-galactosidase, green fluorescent protein (GFP), or any protein 
detectable by an antibody. 

The above sequence confirmed synthetic GAS-SV40 promoter element is 
subcloned into the pSEAP-Promoter vector obtained from Clontech using Hindlll and 
Xhol, effectively replacing the SV40 promoter with the amplified GAS:SV40 promoter 
element, to create the GAS-SEAP vector. However, this vector does not contain a 
neomycin resistance gene, and therefore, is not preferred for mammalian expression 
systems. 

Thus, in order to generate mammalian stable cell lines expressing the GAS-SEAP 
reporter, the GAS-SEAP cassette is removed from the GAS-SEAP vector using Sail and 
NotI, and inserted into a backbone vector containing the neomycin resistance gene, such as 
pGFP-1 (Clontech), using these restriction sites in the multiple cloning site, to create the 
GAS-SEAP/Neo vector. Once this vector is transfected into mammalian cells, this vector 
can then be used as a reporter molecule for GAS binding as described in Examples 13-14. 

Other constructs can be made using the above description and replacing GAS with 
a different promoter sequence. For example, construction of reporter molecules containing 
NFK-B and EGR promoter sequences are described in Examples 15 and 16. However, 
many other promoters can be substituted using the protocols described in these Examples. 
For instance, SRE, IL-2, NFAT, or Osteocalcin promoters can be substituted, alone or in 
combination (e.g., GAS/NF-KB/EGR, GAS/NF-KB, I1-2/NFAT, or NF-KB/GAS). 
Similarly, other cell lines can be used to test reporter construct activity, such as HELA 
(epithelial), HUVEC (endothelial), Reh (B-cell), Saos-2 (osteoblast), HUVAC (aortic), or 
Cardiomyocyte. 

Example 13: High-Throughpu t Screening Assay for T-cell Activity. 

The following protocol is used to assess T-cell activity by identifying factors, and 
determining whether supernate containing a polypeptide of the invention proliferates 
and/or differentiates T-cells. T-cell activity is assessed using the GAS/SEAP/Neo 
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construct produced in Example 12. Thus, factors that increase SEAP activity indicate the 
ability to activate the Jaks-STATS signal transduction pathway. The T-cell used in this 
assay is Jurkat T-cells (ATCC Accession No. TIB-152), although Molt-3 cells (ATCC 
Accession No. CRL-1552) and Molt-4 cells (ATCC Accession No. CRL-1582) cells can 
also be used. 

Jurkat T-cells are lymphoblastic CD4+ Thl helper cells. In order to generate stable 
cell lines, approximately 2 million Jurkat cells are transfected with the GAS-SEAP/neo 
vector using DMRIE-C (Life Technologies)(transfection procedure described below). The 
transfected cells are seeded to a density of approximately 20,000 cells per well and 
transfectants resistant to 1 mg/ml genticin selected. Resistant colonies are expanded and 
then tested for their response to increasing concentrations of interferon gamma. The dose 
response of a selected clone is demonstrated. 

Specifically, the following protocol will yield sufficient cells for 75 wells 
containing 200 ul of cells. Thus, it is either scaled up, or performed in multiple to generate 
sufficient cells for multiple 96 well plates. Jurkat cells are maintained in RPMI + 10% 
serum with l%Pen-Strep. Combine 2.5 mis of OPTLMEM (Life Technologies) with 10 
ug of plasmid DNA in a T25 flask. Add 2.5 ml OPTI-MEM containing 50 ul of DMRIE-C 
and incubate at room temperature for 15-45 mins. 

During the incubation period, count cell concentration, spin down the required 
number of cells (10 7 per transfection), and resuspend in OPTI-MEM to a final 
concentration of 10 7 cells/ml. Then add 1ml of 1 x 10 7 cells in OPTI-MEM to T25 flask 
and incubate at 37 degrees C for 6 hrs. After the incubation, add 10 ml of RPMI + 15% 
serum. 

The Jurkat:GAS-SEAP stable reporter lines are maintained in RPMI + 10% serum, 
1 mg/ml Genticin, and 1% Pen-Strep. These cells are treated with supernatants containing 
polypeptides of the invention and/or induced polypeptides of the invention as produced by 
the protocol described in Example 1 1 . 
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On the day of treatment with the supernatant, the cells should be washed and 
resuspended in fresh RPMI + 10% serum to a density of 500,000 cells per ml. The exact 
number of cells required will depend on the number of supernatants being screened. For 
one 96 well plate, approximately 10 million cells (for 10 plates, 100 million cells) are 
required. 

Transfer the cells to a triangular reservoir boat, in order to dispense the cells into a 
96 well dish, using a 12 channel pipette. Using a 12 channel pipette, transfer 200 ul of 
cells into each well (therefore adding 100, 000 cells per well). 

After all the plates have been seeded, 50 ul of the supernatants are transferred 
directly from the 96 well plate containing the supernatants into each well using a 12 
channel pipette. In addition, a dose of exogenous interferon gamma (0.1, 1.0, 10 ng) is 
added to wells H9, H10, and HI 1 to serve as additional positive controls for the assay. 

The 96 well dishes containing Jurkat cells treated with supernatants are placed in an 
incubator for 48 hrs (note: this time is variable between 48-72 hrs). 35 ul samples from 
each well are then transferred to an opaque 96 well plate using a 12 channel pipette. The 
opaque plates should be covered (using sellophene covers) and stored at -20 degrees C 
until SEAP assays are performed according to Example 17. The plates containing the 
remaining treated cells are placed at 4 degrees C and serve as a source of material for 
repeating the assay on a specific well if desired. 

As a positive control, 100 Unit/ml interferon gamma can be used which is known to 
activate Jurkat T cells. Over 30 fold induction is typically observed in the positive control 
wells. 

The above protocol may be used in the generation of both transient, as well as, 
stable transfected cells, which would be apparent to those of skill in the art. 

Example 14: High-Throughput Screening A ssay Identifying Myeloid Activity 

The following protocol is used to assess myeloid activity by determining whether 
polypeptides of the invention proliferates and/or differentiates myeloid cells. Myeloid cell 
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activity is assessed using the GAS/SEAP/Neo construct produced in Example 12. Thus, 
factors that increase SEAP activity indicate the ability to activate the Jaks-STATS signal 
transduction pathway. The myeloid cell used in this assay is U937, a pre-monocyte cell 
line, although TF-1, HL60, or KG1 can be used. 

To transiently transfect U937 cells with the GAS/SEAP/Neo construct produced in 
Example 12, a DEAE-Dextran method (Kharbanda et. al., 1994, Cell Growth & 
Differentiation, 5:259-265) is used. First, harvest 2xl0e 7 U937 cells and wash with PBS. 
The U937 cells are usually grown in RPMI 1640 medium containing 10% heat-inactivated 
fetal bovine serum (FBS) supplemented with 100 units/ml penicillin and 100 mg/ml 
streptomycin. 

Next, suspend the cells in 1 ml of 20 mM Tris-HCl (pH 7.4) buffer containing 0.5 
mg/ml DEAE-Dextran, 8 ug GAS-SEAP2 plasmid DNA, 140 mM NaCl, 5 mM KCI, 375 
uM Na 2 HP0 4 .7H 2 0, 1 mM MgCl 2 , and 675 uM CaCl 2 . Incubate at 37 degrees C for 45 
min. 

Wash the cells with RPMI 1640 medium containing 10% FBS and then resuspend 
in 10 ml complete medium and incubate at 37 degrees C for 36 hr. 

The GAS-SEAP/U937 stable cells are obtained by growing the cells in 400 ug/ml 
G418. The G418~free medium is used for routine growth but every one to two months, the 
cells should be re-grown in 400 ug/ml G418 for couple of passages. 

These cells are tested by harvesting 1x10 s cells (this is enough for ten 96- well 
plates assay) and wash with PBS. Suspend the cells in 200 ml above described growth 
medium, with a final density of 5xl0 5 cells/ml Plate 200 ul cells per well in the 96-welI 
plate (or lxlO 5 cells/well). 

Add 50 ul of the supernatant prepared by the protocol described in Example 11. 
Incubate at 37 degrees C for 48 to 72 hr. As a positive control, 100 Unit/ml interferon 
gamma can be used which is known to activate U937 cells. Over 30 fold induction is 
typically observed in the positive control wells. SEAP assay the supernatant according to 
the protocol described in Example 17. 
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Example 15; High-Thr ou ghp ut Screening Assay Identifying Neuronal Activity. 

When cells undergo differentiation and proliferation, a group of genes are activated 
through many different signal transduction pathways. One of these genes, EGR1 (early 
growth response gene 1), is induced in various tissues and cell types upon activation. The 
promoter of EGR1 is responsible for such induction. Using the EGR1 promoter linked to 
reporter molecules, activation of cells can be assessed. 

Particularly, the following protocol is used to assess neuronal activity in PC 12 cell 
lines. PC12 cells (rat phenochromocytoma cells) are known to proliferate and/or 
differentiate by activation with a number of mitogens, such as TPA (tetradecanoyl phorbol 
acetate), NGF (nerve growth factor), and EGF (epidermal growth factor). The EGR1 gene 
expression is activated during this treatment. Thus, by stably transfecting PC12 cells with 
a construct containing an EGR promoter linked to SEAP reporter, activation of PC12 cells 
can be assessed. 

The EGR/SEAP reporter construct can be assembled by the following protocol. 
The EGR-1 promoter sequence (-633 to +l)(Sakamoto K et al., Oncogene 6:867-871 
(1991)) can be PCR amplified from human genomic DNA using the following primers: 
5' GCGCTCGAGGGATGACAGCGATAGAACCCCGG -3' (SEQ ID NO:6) 
5* GCGAAGCTTCGCGAGTCCCCGGATCCGCCTC-3' (SEQ ID NO:7) 
Using the GAS:SEAP/Neo vector produced in Example 12, EGR1 amplified 
product can then be inserted into this vector. Linearize the GAS:SEAP/Neo vector using 
restriction enzymes Xhol/Hindlll, removing the GAS/SV40 stuffer. Restrict the EGR1 
amplified product with these same enzymes. Ligate the vector and the EGR1 promoter. 

To prepare 96 well-plates for cell culture, two mis of a coating solution (1:30 
dilution of collagen type I (Upstate Biotech Inc. Cat#08-115) in 30% ethanol (filter 
sterilized)) is added per one 10 cm plate or 50 ml per well of the 96-weil plate, and allowed 
to air dry for 2 nr. 
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PC12 cells are routinely grown in RPMI-1640 medium (Bio Whittaker) containing 
10% horse serum (JRH BIOSCIENCES, Cat. # 12449-78P), 5% heat- in activated fetal 
bovine serum (FBS) supplemented with 100 units/ml penicillin and 100 ug/ml 
streptomycin on a precoated 10 cm tissue culture dish. One to four split is done every 
three to four days. Cells are removed from the plates by scraping and resuspended with 
pipetting up and down for more than 15 times. 

Transfect the EGR/SEAP/Neo construct into PC12 using the Lipofectamine 
protocol described in Example 11. EGR-SE AP/PC 1 2 stable cells are obtained by growing 
the cells in 300 ug/ml G418. The G418-free medium is used for routine growth but every 
one to two months, the cells should be re-grown in 300 ug/ml G4I8 for couple of passages. 

To assay for neuronal activity, a 10 cm plate with cells around 70 to 80% confluent 
is screened by removing the old medium. Wash the cells once with PBS (Phosphate 
buffered saline). Then starve the ceils in low serum medium (RPMI-1640 containing 1% 
horse serum and 0.5% FBS with antibiotics) overnight. 

The next morning, remove the medium and wash the cells with PBS. Scrape off 
the cells from the plate, suspend the cells well in 2 ml low serum medium. Count the cell 
number and add more low serum medium to reach final cell density as 5xl0 5 cells/ml. 

Add 200 ul of the cell suspension to each well of 96-well plate (equivalent to lxlO 5 
cells/well). Add 50 ul supernatant produced by Example 1 1 , 37°C for 48 to 72 hr. As a 
positive control, a growth factor known to activate PC12 cells through EGR can be used, 
such as 50 ng/ul of Neuronal Growth Factor (NGF). Over fifty-fold induction of SEAP is 
typically seen in the positive control wells. SEAP assay the supernatant according to 
Example 17. 


Example 16: High -Through put Screening Assa y for T-cell Activity 

NF-KB (Nuclear Factor KB) is a transcription factor activated by a wide variety of 
agents including the inflammatory cytokines IL-1 and TNF, CD30 and CD40, 
lymphotoxin-alpha and lymphotoxin-beta, by exposure to LPS or thrombin, and by 
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expression of certain viral gene products. As a transcription factor, NF-KB regulates the 
expression of genes involved in immune cell activation, control of apoptosis (NF- KB 
appears to shield cells from apoptosis), B and T-cell development, anti-viral and 
antimicrobial responses, and multiple stress responses. 

In non-stimulated conditions, NF- KB is retained in the cytoplasm with I-KB 
(Inhibitor KB). However, upon stimulation, I- KB is phosphorylated and degraded, 
causing NF- KB to shuttle to the nucleus, thereby activating transcription of target genes. 
Target genes activated by NF- KB include IL-2, IL-6, GM-CSF, ICAM-1 and class 1 
MHC. 

Due to its central role and ability to respond to a range of stimuli, reporter 
constructs utilizing the NF-KB promoter element are used to screen the supernatants 
produced in Example 1 1 . Activators or inhibitors of NF-KB would be useful in treating 
diseases. For example, inhibitors of NF-KB could be used to treat those diseases related to 
the acute or chronic activation of NF-KB, such as rheumatoid arthritis. 

To construct a vector containing the NF-KB promoter element, a PCR based 
strategy is employed. The upstream primer contains four tandem copies of the NF-KB 
binding site (GGGGACTTTCCC) (SEQ ID NO:8), 18 bp of sequence complementary to 
the 5' end of the SV40 early promoter sequence, and is flanked with an Xhol site: 

5':GCGGCCTCGAGGGGACTTTGCCGGGGACTTTCCGGGGACTTTCCGGGACTTT 
CCATCCTGCCATCTCAATTAG:3' (SEQ ID NO:9) 

The downstream primer is complementary to the 3' end of the SV40 promoter and 
is flanked with a Hind III site: 

5' :GCGGCAAGCTTTTTGCAAAGCCTAGGC:3' (SEQ ID NO:4) 

PCR amplification is performed using the SV40 promoter template present in the 
pB-gal:promoter plasmid obtained from Clontech. The resulting PCR fragment is digested 
with Xhol and Hind III and subcloned into BLSK2-. (Stratagene) Sequencing with the T7 
and T3 primers confirms the insert contains the following sequence: 
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5 ' : CTCG AGGGG ACTTTCCCGGGGACTTTCCGGGG ACTTTCCGGG ACTTTCC ATC 
TGCCATCTCAATTAGTCAGCAACCATAGTCCCGCCCCTAACTCCGCCCATCCCG 
CCCCTAACTCCGCCCAGTTCCGCCCATTCTCCGCCCCATGGCTGACTAATTTTT 
TTTATTTATGCAGAGGCCGAGGCCGCCTCGGCCTCTGAGCTATTCCAGAAGTA 


Next, replace the S V40 minimal promoter element present in the pSEAP2-promoter 
plasmid (Clontech) with this NF-KB/SV40 fragment using Xhol and HindllL However, 
this vector does not contain a neomycin resistance gene, and therefore, is not preferred for 
mammalian expression systems. 

In order to generate stable mammalian cell lines, the NF-KB/SV40/SEAP cassette 
is removed from the above NF-KB/SEAP vector using restriction enzymes Sail and NotI, 
and inserted into a vector containing neomycin resistance. Particularly, the NF- 
KB/SV40/SEAP cassette was inserted into pGFP-I (Clontech), replacing the GFP gene, 
after restricting pGFP-1 with Sail and NotI. 

Once NF-KB/SV40/SEAP/Neo vector is created, stable Jurkat T-cells are created 
and maintained according to the protocol described in Example 13. Similarly, the method 
for assaying supernatants with these stable Jurkat T-cells is also described in Example 13. 
As a positive control, exogenous TNF alpha (0.1,1, 10 ng) is added to wells H9, H10, and 
HI 1, with a 5-10 fold activation typically observed. 

Example 17: Assay for SEAP Activity 

As a reporter molecule for the assays described in Examples 13-16, SEAP activity 
is assayed using the Tropix Phospho-light Kit (Cat. BP-400) according to the following 
general procedure. The Tropix Phospho-light Kit supplies the Dilution, Assay, and 
Reaction Buffers used below. 

Prime a dispenser with the 2.5x Dilution Buffer and dispense 15 ul of 2.5x dilution 
buffer into Optiplates containing 35 ul of a supernatant. Seal the plates with a plastic 



TTTTGCAAAAAGCTT:3' (SEQ id 


NO: 10) 
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sealer and incubate at 65 degree C for 30 min. Separate the Optiplates to avoid uneven 
heating. 

Cool the samples to room temperature for 15 minutes. Empty the dispenser and 
prime with the Assay Buffer. Add 50 ml Assay Buffer and incubate at room temperature 5 
5 min. Empty the dispenser and prime with the Reaction Buffer (see the table below). Add 
50 u] Reaction Buffer and incubate at room temperature for 20 minutes. Since the 
intensity of the chemiluminescent signal is time dependent, and it takes about 10 minutes 
to read 5 plates on luminometer, one should treat 5 plates at each time and start the second 
set 10 minutes later. 

0 Read the relative light unit in the luminometer. Set H12 as blank, and print the 

results. An increase in chemiluminescence indicates reporter activity. 


Reaction Buffer Formulation: 

#of plates Rxn bufTer diluenUmj) CSPD (ml) 


10 

60 

3 

11 

65 

3.25 

12 

70 

3.5 

13 

75 

3.75 

14 

80 

4 

15 

85 

4.25 

16 

90 

4.5 

17 

95 

4.75 

18 

100 

5 

19 

105 

5.25 

20 

110 

5.5 

21 

115 

5.75 

22 

120 

6 

23 

125 

6.25 

24 

130 

6.5 

25 

135 

6.75 

26 

140 

7 

27 

145 

7.25 

28 

150 

7.5 

29 

155 

7.75 

30 

160 

8 

31 

165 

8.25 

32 

170 

8.5 

33 

175 

8.75 

34 

180 

9 

35 

185 

9.25 

36 

190 

9.5 

37 

195 

9.75 

38 

200 

10 

39 

205 

10.25 
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40 
41 
42 
43 
44 
45 
46 
47 
48 
49 
50 


210 
215 
220 
225 
230 
235 
240 
245 
250 
255 
260 


10.5 

10.75 

11 

11.25 
11.5 
11.75 
12 

12.25 
12.5 
12.75 
13 


Example 18: High-Throughput Screening Assay Identifying Changs in Small 
Molecule Concentration and Membrane Permeability 

Binding of a ligand to a receptor is known to alter intracellular levels of small 
molecules, such as calcium, potassium, sodium, and pH, as well as alter membrane 
potential. These alterations can be measured in an assay to identify supematants which 
bind to receptors of a particular cell. Although the following protocol describes an assay 
for calcium, this protocol can easily be modified to detect changes in potassium, sodium, 
pH, membrane potential, or any other small molecule which is detectable by a fluorescent 
probe. 

The following assay uses Fluorometric Imaging Plate Reader ("FLIPR") to 
measure changes in fluorescent molecules (Molecular Probes) that bind small molecules. 
Clearly, any fluorescent molecule detecting a small molecule can be used instead of the 
calcium fluorescent molecule, fluo-4 (Molecular Probes, Inc.; catalog no. F- 14202), used 
here. 

For adherent cells, seed the cells at 10,000 -20,000 cells/well in a Co-star black 96- 
well plate with clear bottom. The plate is incubated in a CO, incubator for 20 hours. The 
adherent cells are washed two times in Biotek washer with 200 ul of HBSS (Hank's 
Balanced Salt Solution) leaving 100 ul of buffer after the final wash. 

A stock solution of I mg/ml fluo-4 is made in 10% pluronic acidDMSO. To load 
the cells with fluo-4 , 50 ul of 12 ug/ml fluo-4 is added to each well. The plate is 
incubated at 37 degrees C in a C0 2 incubator for 60 min. The plate is washed four times in 
the Biotek washer with HBSS leaving 100 ul of buffer. 
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For non-adherent cells, the cells are spun down from culture media. Cells are re- 
suspended to 2-5xl0 6 celis/ml with HBSS in a 50-ml conical tube. 4 ul of 1 mg/ml fluo-4 
solution in 10% pluronic acid DMSO is added to each ml of cell suspension. The tube is 
then placed in a 37 degrees C water bath for 30-60 min. The cells are washed twice with 
HBSS, resuspended to ixlO 6 cells/ml, and dispensed into a microplate, 100 ul/well. The 
plate is centrifuged at 1000 rpm for 5 min. The plate is then washed once in Denley 
CellWash with 200 ul, followed by an aspiration step to 100 ul final volume. 

For a non-cell based assay, each well contains a fluorescent molecule, such as fluo- 
4 . The supernatant is added to the well, and a change in fluorescence is detected. 

To measure the fluorescence of intracellular calcium, the FLIPR is set for the 
following parameters: (1) System gain is 300-800 mW; (2) Exposure time is 0.4 second; 
(3) Camera F/stop is F/2; (4) Excitation is 488 nm; (5) Emission is 530 nm; and (6) Sample 
addition is 50 ul. Increased emission at 530 nm indicates an extracellular signaling event 
which has resulted in an increase in the intracellular Ca ++ concentration. 

Example 19: High- Through™ it Screening As say Identifying Tyrosine Kinase Activity 

The Protein Tyrosine Kinases (PTK) represent a diverse group of transmembrane 
and cytoplasmic kinases. Within the Receptor Protein Tyrosine Kinase RPTK) group are 
receptors for a range of mitogenic and metabolic growth factors including the PDGF, FGF, 
EGF, NGF, HGF and Insulin receptor subfamilies. In addition there are a large family of 
RPTKs for which the corresponding ligand is unknown. Ligands for RPTKs include 
mainly secreted small proteins, but also membrane- bound and extracellular matrix 
proteins. 

Activation of RPTK by ligands involves Iigand-mediated receptor dimerization, 
resulting in transphosphorylation of the receptor subunits and activation of the cytoplasmic 
tyrosine kinases. The cytoplasmic tyrosine kinases include receptor associated tyrosine 
kinases of the src-family (e.g., src, yes, lck, lyn, fyn) and non-receptor linked and cytosolic 
protein tyrosine kinases, such as the Jak family, members of which mediate signal 
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transduction triggered by the cytokine superfamily of receptors (e.g., the Interleukins, 
Interferons, GM-CSF, and Leptin). 

Because of the wide range of known factors capable of stimulating tyrosine kinase 
activity, the identification of novel human secreted proteins capable of activating tyrosine 
kinase signal transduction pathways are of interest. Therefore, the following protocol is 
designed to identify those novel human secreted proteins capable of activating the tyrosine 
kinase signal transduction pathways. 

Seed target cells (e.g., primary keratinocytes) at a density of approximately 25,000 
cells per well in a 96 well Loprodyne Silent Screen Plates purchased from Nalge Nunc 
(Naperville, IL). The plates are sterilized with two 30 minute rinses with 100% ethanol, 
rinsed with water and dried overnight Some plates are coated for 2 hr with 100 ml of cell 
culture grade type I collagen (50 mg/ml), gelatin (2%) or polylysine (50 mg/ml), all of 
which can be purchased from Sigma Chemicals (St. Louis, MO) or 10% Matrigel 
purchased from Becton Dickinson (Bedford,MA), or calf serum, rinsed with PBS and 
stored at 4 degree C. Cell growth on these plates is assayed by seeding 5,000 cells/well in 
growth medium and indirect quantitation of cell number through use of alamarBlue as 
described by the manufacturer Alamar Biosciences, Inc. (Sacramento, CA) after 48 hr. 
Falcon plate covers #3071 from Becton Dickinson (Bedford,MA) are used to cover the 
Loprodyne Silent Screen Plates. Falcon Microtest III cell culture plates can also be used in 
some proliferation experiments. 

To prepare extracts, A431 cells are seeded onto the nylon membranes of Loprodyne 
plates (20,000/200ml/well) and cultured overnight in complete medium. Cells are 
quiesced by incubation in serum-free basal medium for 24 hr. After 5-20 minutes 
treatment with EGF (60ng/ml) or 50 ul of the supernatant produced in Example 1 1, the 
medium was removed and 100 ml of extraction buffer ((20 mM HEPES pH 7.5, 0.15 M 
NaCl, 1% Triton X-100, 0.1% SDS, 2 mM Na3V04, 2 mM Na4P207 and a cocktail of 
protease inhibitors (# 1836170) obtained from Boeheringer Mannheim (Indianapolis, IN) is 
added to each well and the plate is shaken on a rotating shaker for 5 minutes at 4 degrees 
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C. The plate is then placed in a vacuum transfer manifold and the extract filtered through 
the 0.45 mm membrane bottoms of each well using house vacuum. Extracts are collected 
in a 96-well catch/assay plate in the bottom of the vacuum manifold and immediately 
placed on ice. To obtain extracts clarified by centrifugation, the content of each well, after 
detergent solubilization for 5 minutes, is removed and centrifuged for 15 minutes at 4 
degrees C at 16,000 x g. 

Test the filtered extracts for levels of tyrosine kinase activity. Although many 
methods of detecting tyrosine kinase activity are known, one method is described here. 

Generally, the tyrosine kinase activity of a supernatant is evaluated by determining 
its ability to phosphorylate a tyrosine residue on a specific substrate (a biotinylated 
peptide). Biotinylated peptides that can be used for this purpose include PSK1 
(corresponding to amino acids 6-20 of the cell division kinase cdc2-p34) and PSK2 
(corresponding to amino acids 1-17 of gastrin). Both peptides are substrates for a range of 
tyrosine kinases and are available from Boehringer Mannheim. 

The tyrosine kinase reaction is set up by adding the following components in order. 
First, add lOul of 5uM Biotinylated Peptide, then lOul ATP/Mg 2 + (5mM ATP/50mM 
MgCl2), then lOul of 5x Assay Buffer (40mM imidazole hydrochloride, pH7.3, 40 mM 
beta-glycerophosphate, ImM EGTA, lOOmM MgCl 2 , 5 mM MnCl 2> 0.5 mg/ml BSA), 
then 5ul of Sodium Vanadate(lmM), and then 5ul of water. Mix the components gently 
and preincubate the reaction mix at 30 degrees C for 2 min. Initial the reaction by adding 
lOul of the control enzyme or the filtered supernatant. 

The tyrosine kinase assay reaction is then terminated by adding 10 ul of 120mm 
EDTA and place the reactions on ice. 

Tyrosine kinase activity is determined by transferring 50 ul aliquot of reaction 
mixture to a microtiter plate (MTP) module and incubating at 37 degrees C for 20 min. 
This allows the streptavadin coated 96 well plate to associate with the biotinylated peptide. 
Wash the MTP module with 300ul/well of PBS four times. Next add 75 ul of anti- 
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phospotyrosine antibody conjugated to horse radish peroxidase(anti-P-Tyr-POD(0.5u/ml)) 
to each well and incubate at 37 degrees C for one hour. Wash the well as above. 

Next add lOOul of peroxidase substrate solution (Boehringer Mannheim) and 
incubate at room temperature for at least 5 mins (up to 30 min). Measure the absorbance 
of the sample at 405 nm by using ELISA reader. The level of bound peroxidase activity is 
quantitated using an ELISA reader and reflects the level of tyrosine kinase activity. 

Example 20: High-Throughput Screening A ssay Identifying Phosphorylation Activity 

As a potential alternative an dVor compliment to the assay of protein tyrosine kinase 
activity described in Example 19, an assay which detects activation (phosphorylation) of 
major intracellular signal transduction intermediates can also be used. For example, as 
described below one particular assay can detect tyrosine phosphorylation of the Erk-1 and 
Erk-2 kinases. However, phosphorylation of other molecules, such as Raf, JNK, p38 
MAP, Map kinase kinase (MEK), MEK kinase, Src, Muscle specific kinase (MuSK), 
IRAK, Tec, and Janus, as well as any other phosphoserine, phosphotyrosine, or 
phosphothreonine molecule, can be detected by substituting these molecules for Erk-1 or 
Erk-2 in the following assay. 

Specifically, assay plates are made by coating the wells of a 96-well ELISA plate 
with 0.1ml of protein G (lug/ml) for 2 hr at room temp, (RT). The plates are then rinsed 
with PBS and blocked with 3% BSA/PBS for 1 hr at RT. The protein G plates are then 
treated with 2 commercial monoclonal antibodies (lOOng/well) against Erk-land Erk-2 (1 
hr at RT) (Santa Cruz Biotechnology). (To detect other molecules, this step can easily be 
modified by substituting a monoclonal antibody detecting any of the above described 
molecules.) After 3-5 rinses with PBS, the plates are stored at 4 degrees C until use. 

A43 1 cells are seeded at 20,000/well in a 96-well Loprodyne fi Herniate and 
cultured overnight in growth medium. The cells are then starved for 48 hr in basal medium 
(DMEM) and then treated with EGF (6ng/well) or 50 ul of the supernatants obtained in 
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Example 1 1 for 5-20 minutes. The cells are then solubilized and extracts filtered directly 
into the assay plate. 

After incubation with the extract for 1 hr at RT, the wells are again rinsed. As a 
positive control, a commercial preparation of MAP kinase (lOng/well) is used in place of 
A431 extract. Plates are then treated with a commercial polyclonal (rabbit) antibody 
(lug/ml) which specifically recognizes the phosphorylated epitope of the Erk-1 and Erk-2 
kinases (1 hr at RT). This antibody is biotinylated by standard procedures. The bound 
polyclonal antibody is then quantitated by successive incubations with Europium- 
streptavidin and Europium fluorescence enhancing reagent in the Wallac DELHA 
instrument (time-resolved fluorescence). An increased fluorescent signal over background 
indicates a phosphorylation. 

Example 21: Method of D ete rm ining Alterations in a Gene Corresponding to a 
Polynucleotide 

RNA isolated from entire families or individual patients presenting with a 
phenotype of interest (such as a disease) is be isolated. cDNA is then generated from these 
RNA samples using protocols known in the art. (See, Sambrook.) The cDNA is then used 
as a template for PCR, employing primers surrounding regions of interest in SEQ ID 
NO:X. Suggested PCR conditions consist of 35 cycles at 95 degrees C for 30 seconds; 60- 
120 seconds at 52-58 degrees C; and 60-120 seconds at 70 degrees C, using buffer 
solutions described in Sidransky et ah, Science 252:706 (1991). 

PCR products are then sequenced using primers labeled at their 5* end with T4 
polynucleotide kinase, employing SequiTherm Polymerase. (Epicentre Technologies). 
The intron-exon borders of selected exons is also determined and genomic PCR products 
analyzed to confirm the results. PCR products harboring suspected mutations is then 
cloned and sequenced to validate the results of the direct sequencing. 

PCR products is cloned into T-tailed vectors as described in Holton et al., Nucleic 
Acids Research, 19:1156 (1991) and sequenced with T7 polymerase (United States 
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Biochemical). Affected individuals are identified by mutations not present in unaffected 
individuals. 

Genomic rearrangements are also observed as a method of determining alterations 
in a gene corresponding to a polynucleotide. Genomic clones isolated according to 
Example 2 are nick-translated with digoxigenindeoxy-uridine 5-triphosphate (Boehringer 
Manheim), and FISH performed as described in Johnson et aL, Methods Cell Biol. 35:73- 
99 (1991). Hybridization with the labeled probe is carried out using a vast excess of 
human cot-1 DNA for specific hybridization to the corresponding genomic locus. 

Chromosomes are counterstained with 4,6-diamino-2-phenylidole and propidium 
iodide, producing a combination of C- and R-bands. Aligned images for precise mapping 
are obtained using a triple-band filter set (Chroma Technology, Brattleboro, VT) in 
combination with a cooled charge-coupled device camera (Photometries, Tucson, AZ) and 
variable excitation wavelength filters. (Johnson et al., Genet. Anal. Tech. Appl., 8:75 
(1991).) Image collection, analysis and chromosomal fractional length measurements are 
performed using the ISee Graphical Program System. (Inovision Corporation, Durham, 
NC.) Chromosome alterations of the genomic region hybridized by the probe are 
identified as insertions, deletions, and translocations. These alterations are used as a 
diagnostic marker for an associated disease. 

Example 22: Method of Detecting Abnormal Levels of a Polypeptide j n a Biolog ical 
Sample 

A polypeptide of the present invention can be detected in a biological sample, and 
if an increased or decreased level of the polypeptide is detected, this polypeptide is a 
marker for a particular phenotype. Methods of detection are numerous, and thus, it is 
understood that one skilled in the art can modify the following assay to fit their particular 
needs. 

For example, antibody-sandwich ELISAs are used to detect polypeptides in a 
sample, preferably a biological sample. Wells of a microtiter plate are coated with specific 
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antibodies, at a final concentration of 0.2 to 10 ug/ml. The antibodies are either 
monoclonal or polyclonal and are produced by the method described in Example 10. The 
wells are blocked so that non-specific binding of the polypeptide to the well is reduced. 

The coated wells are then incubated for > 2 hours at RT with a sample containing 
the polypeptide. Preferably, serial dilutions of the sample should be used to validate 
results. The plates are then washed three times with deionized or distilled water to remove 
unbounded polypeptide. 

Next, 50 ul of specific antibody-alkaline phosphatase conjugate, at a concentration 
of 25-400 ng, is added and incubated for 2 hours at room temperature. The plates are again 
washed three times with deionized or distilled water to remove unbounded conjugate. 

Add 75 ul of 4-methylumbelIiferyl phosphate (MUP) or p-nitropheny] phosphate 
(NPP) substrate solution to each well and incubate 1 hour at room temperature. Measure 
the reaction by a microtiter plate reader. Prepare a standard curve, using serial dilutions of 
a control sample, and plot polypeptide concentration on the X-axis (log scale) and 
fluorescence or absorbance of the Y-axis (linear scale). Interpolate the concentration of 
the polypeptide in the sample using the standard curve. 

Example 23: Formulation 

The invention also provides methods of treatment and/or prevention of diseases or 
disorders (such as, for example, any one or more of the diseases or disorders disclosed 
herein) by administration to a subject of an effective amount of a Therapeutic. By 
therapeutic is meant a polynucleotides or polypeptides of the invention (including 
fragments and variants), agonists or antagonists thereof, and/or antibodies thereto, in 
combination with a pharmaceutically acceptable carrier type (e.g., a sterile carrier). 

The Therapeutic will be formulated and dosed in a fashion consistent with good 
medical practice, taking into account the clinical condition of the individual patient 
(especially the side effects of treatment with the Therapeutic alone), the site of delivery, 
the method of administration, the scheduling of administration, and other factors known to 
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practitioners. The "effective amount" for purposes herein is thus determined by such 
considerations. 

As a general proposition, the total pharmaceutic ally effective amount of the 
Therapeutic administered parenterally per dose will be in the range of about lug/kg/day to 
10 mg/kg/day of patient body weight, although, as noted above, this will be subject to 
therapeutic discretion. More preferably, this dose is at least 0.01 mg/kg/day, and most 
preferably for humans between about 0.01 and 1 mg/kg/day for the hormone. If given 
continuously, the Therapeutic is typically administered at a dose rate of about 1 ug/kg/hour 
to about 50 ug/kg/hour, either by 1-4 injections per day or by continuous subcutaneous 
infusions, for example, using a mini-pump. An intravenous bag solution may also be 
employed. The length of treatment needed to observe changes and the interval following 
treatment for responses to occur appears to vary depending on the desired effect. 

Therapeutics can be are administered orally, rectal ly, parenterally, intracistemally, 
intravaginally, intraperitoneal! y, topically (as by powders, ointments, gels, drops or 
transdermal patch), bucally, or as an oral or nasal spray. "Pharmaceutical ]y acceptable 
carrier" refers to a non-toxic solid, semisolid or liquid filler, diluent, encapsulating material 
or formulation auxiliary of any. The term "parenteral" as used herein refers to modes of 
administration which include intravenous, intramuscular, intraperitoneal, intrasternal, 
subcutaneous and intraarticular injection and infusion. 

Therapeutics of the invention are also suitably administered by sustained-reiease 
systems. Suitable examples of sustained-release Therapeutics are administered orally, 
rectally, parenterally, intracistemally, intravaginally, intraperitoneally, topically (as by 
powders, ointments, gels, drops or transdermal patch), bucally, or as an oral or nasal spray. 
"Pharmaceutical^ acceptable carrier" refers to a non-toxic solid, semisolid or liquid filler, 
diluent, encapsulating material or formulation auxiliary of any type. The term "parenteral" 
as used herein refers to modes of administration which include intravenous, intramuscular, 
intraperitoneal, intrasternal, subcutaneous and intraarticular injection and infusion. 
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Therapeutics of the invention are also suitably administered by sustained-release 
systems. Suitable examples of sustained-release Therapeutics include suitable polymeric 
materials (such as, for example, semi -permeable polymer matrices in the form of shaped 
articles, e.g., films, or mirocapsules), suitable hydrophobic materials (for example as an 
emulsion in an acceptable oil) or ion exchange resins, and sparingly soluble derivatives 
(such as, for example, a sparingly soluble salt). 

Sustained-release matrices include polylactides (U.S. Pat. No. 3,773,919, EP 
58,481), copolymers of L-glutamic acid and gamma-ethyl-L-glutamate (Sidman et aL, 
Biopolymers 22:547-556 (1983)), poly (2- hydroxyethyl methacrylate) (Langer et ah, J. 
Biomed. Mater. Res. 15:167-277 (1981), and Langer, Chem. Tech. 12:98-105 (1982)), 
ethylene vinyl acetate (Langer et al., Id.) or poly-D- (-)-3-hydroxybutyric acid (EP 
133,988). 

Sustained-release Therapeutics also include HposomalJy entrapped Therapeutics of 
the invention {see generally. Langer, Science 249:1527-1533 (1990); Treat et aL, in 
Liposomes in the Therapy of Infectious Disease and Cancer, Lopez-Berestein and Fidler 
(eds.), Liss, New York, pp. 317 -327 and 353-365 (1989)). Liposomes containing the 
Therapeutic are prepared by methods known per se: DE 3,218,121; Epstein et aL, Proc. 
Natl. Acad. Sci. (USA) 82:3688-3692 (1985); Hwang et aL, Proc. Natl. Acad. Sci.(USA) 
77:4030-4034 (1980); EP 52,322; EP 36,676; EP 88,046; EP 143,949; EP 142,641; 
Japanese Pat. Appl. 83-118008; U.S. Pat. Nos. 4,485,045 and 4,544,545; and EP 102,324. 
Ordinarily, the liposomes are of the small (about 200-800 Angstroms) unilamellar type in 
which the lipid content is greater than about 30 mol. percent cholesterol, the selected 
proportion being adjusted for the optimal Therapeutic. 

In yet an additional embodiment, the Therapeutics of the invention are delivered 
by way of a pump {see Langer, supra; Sefton, CRC Crit. Ref. Biomed. Eng. 14:201 (1987); 
Buchwald et aL, Surgery 88:507 (1980); Saudek et aL, N. Engl. J. Med. 321:574 (1989)). 

Other controlled release systems are discussed in the review by Langer {Science 
249:1527-1533 (1990)). 
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For parenteral administration, in one embodiment, the Therapeutic is formulated 
generally by mixing it at the desired degree of purity, in a unit dosage injectable form 
(solution, suspension, or emulsion), with a pharmaceutic ally acceptable carrier, i.e., one 
that is non-toxic to recipients at the dosages and concentrations employed and is 
compatible with other ingredients of the formulation. For example, the formulation 
preferably does not include oxidizing agents and other compounds that are known to be 
deleterious to the Therapeutic. 

Generally, the formulations are prepared by contacting the Therapeutic uniformly 
and intimately with liquid carriers or finely divided solid carriers or both. Then, if 
necessary, the product is shaped into the desired formulation. Preferably the carrier is a 
parenteral carrier, more preferably a solution that is isotonic with the blood of the 
recipient. Examples of such carrier vehicles include water, saline, Ringer's solution, and 
dextrose solution. Non-aqueous vehicles such as fixed oils and ethyl oleate are also useful 
herein, as well as liposomes. 

The carrier suitably contains minor amounts of additives such as substances that 
enhance isotonicity and chemical stability. Such materials are non-toxic to recipients at 
the dosages and concentrations employed, and include buffers such as phosphate, citrate, 
succinate, acetic acid, and other organic acids or their salts; antioxidants such as ascorbic 
acid; low molecular weight (less than about ten residues) polypeptides, e.g., polyarginine 
or tripeptides; proteins, such as serum albumin, gelatin, or immunoglobulins; hydrophilic 
polymers such as polyvinylpyrrolidone; amino acids, such as glycine, glutamic acid, 
aspartic acid, or arginine; monosaccharides, di saccharides, and other carbohydrates 
including cellulose or its derivatives, glucose, manose, or dextrins; chelating agents such as 
EDTA; sugar alcohols such as mannitol or sorbitol; counterions such as sodium; and/or 
nonionic surfactants such as polysorbates, poloxamers, or PEG. 

The Therapeutic is typically formulated in such vehicles at a concentration of about 
0.1 mg/ml to 100 mg/ml, preferably 1-10 mg/ml, at a pH of about 3 to 8. It will be 


SUBSTITUTE SHEET (RULE 26) 


WO 00/29435 PCT/US99/25031 

506 

understood that the use of certain of the foregoing excipients, carriers, or stabilizers will 

result in the formation of polypeptide salts. 

Any pharmaceutical used for therapeutic administration can be sterile. Sterility is 

readily accomplished by filtration through sterile filtration membranes (e.g., 0.2 micron 

membranes). Therapeutics generally are placed into a container having a sterile access 

port, for example, an intravenous solution bag or vial having a stopper pierceable by a 

hypodermic injection needle. 

Therapeutics ordinarily will be stored in unit or multi-dose containers, for example, 

sealed ampoules or vials, as an aqueous solution or as a lyophilized formulation for 
reconstitute on. As an example of a lyophilized formulation, 10-ml vials are filled with 5 
ml of sterile-filtered 1% (w/v) aqueous Therapeutic solution, and the resulting mixture is 
lyophilized. The infusion solution is prepared by reconstituting the lyophilized 
Therapeutic using bacteriostatic Water-for-Injection. 

The invention also provides a pharmaceutical pack or kit comprising one or more 
containers filled with one or more of the ingredients of the Therapeutics of the invention. 
Associated with such container(s) can be a notice in the form prescribed by a governmental 
agency regulating the manufacture, use or sale of pharmaceuticals or biological products, 
which notice reflects approval by the agency of manufacture, use or sale for human 
administration. In addition, the Therapeutics may be employed in conjunction with other 
therapeutic compounds. 

The Therapeutics of the invention may be administered alone or in combination 
with adjuvants. Adjuvants that may be administered with the Therapeutics of the invention 
include, but are not limited to, alum, alum plus deoxycholate (ImmunoAg), MTP-PE 
(Biocine Corp.), QS21 (Genentech, Inc.), BCG, and MPL. In a specific embodiment, 
Therapeutics of the invention are administered in combination with alum. In another 
specific embodiment, Therapeutics of the invention are administered in combination with 
QS-21. Further adjuvants that may be administered with the Therapeutics of the invention 
include, but are not limited to, Monophosphoryl lipid immunomodulator, AdjuVax 100a, 
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QS-21, QS-18, CRL1005, Aluminum salts, MF-59, and Virosomal adjuvant technology. 
Vaccines that may be administered with the Therapeutics of the invention include, but are 
not limited to, vaccines directed toward protection against MMR (measles, mumps, 
rubella), polio, varicella, tetanus/diptheria, hepatitis A, hepatitis B, haemophilus influenzae 
B, whooping cough, pneumonia, influenza, Lyme's Disease, rotavirus, cholera, yellow 
fever, Japanese encephalitis, poliomyelitis, rabies, typhoid fever, and pertussis. 
Combinations may be administered either concomitantly, e.g., as an admixture, separately 
but simultaneously or concurrently; or sequentially. This includes presentations in which 
the combined agents are administered together as a therapeutic mixture, and also 
procedures in which the combined agents are administered separately but simultaneously, 
e.g., as through separate intravenous lines into the same individual. Administration "in 
combination" further includes the separate administration of one of the compounds or 
agents given first, followed by the second. 

The Therapeutics of the invention may be administered alone or in combination 
with other therapeutic agents. Therapeutic agents that may be administered in combination 
with the Therapeutics of the invention, include but not limited to, other members of the 
TNF family, chemotherapeutic agents, antibiotics, steroidal and non-steroidal anti- 
inflammatories, conventional immunotherapeutic agents, cytokines and/or growth factors. 
Combinations may be administered either concomitantly, e.g., as an admixture, separately 
but simultaneously or concurrently; or sequentially. This includes presentations in which 
the combined agents are administered together as a therapeutic mixture, and also 
procedures in which the combined agents are administered separately but simultaneously, 
e.g., as through separate intravenous lines into the same individual. Administration "in 
combination" further includes the separate administration of one of the compounds or 
agents given first, followed by the second. 

In one embodiment, the Therapeutics of the invention are administered in combination 
with members of the TNF family. TNF, TNF-related or TNF-Iike molecules that may be 
administered with the Therapeutics of the invention include, but are not limited to, soluble 
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forms of TNF-alpha, lymphotox in -alpha (LT-alpha, also known as TNF-beta), LT-beta 
(found in complex heterotrimer LT-alpha2-beta), OPGL, FasL, CD27L, CD30L, CD40L, 
4-1BBL, DcR3, OX40L, TNF-gamma (International Publication No. WO 96/14328), 
AIM-I (International Publication No. WO 97/33899), endokine-aipha (International 
5 Publication No. WO 98/07880), TR6 (International Publication No. WO 98/30694), OPG, 
and neutrokine-alpha (International Publication No. WO 98/18921, OX40, and nerve 
growth factor (NGF), and soluble forms of Fas, CD30, CD27, CD40 and 4-IBB, TR2 
(International Publication No. WO 96/34095), DR3 (International Publication No. WO 
97/33904), DR4 (International Publication No. WO 98/32856), TR5 (Internationa] 
10 Publication No. WO 98/30693), TR6 (International Publication No. WO 98/30694), TR7 
(International Publication No. WO 98/41629), TRANK, TR9 (International Publication 
No. WO 98/56892),TR10 (International Publication No. WO 98/54202), 312C2 
(International Publication No. WO 98/06842), and TR12, and soluble forms CD 154, 
CD70, and CD 153. 

15 In certain embodiments, Therapeutics of the invention are administered in 

combination with antiretroviral agents, nucleoside reverse transcriptase inhibitors, non- 
nucleoside reverse transcriptase inhibitors, and/or protease inhibitors. Nucleoside reverse 
transcriptase inhibitors that may be administered in combination with the Therapeutics of 
the invention, include, but are not limited to, RETROVIR™ (zidovudine/ AZT), VIDEX™ 

20 (didanosine/ddl), HIVID™ (zalcitabine/ddC), ZERIT™ (stavudine/d4T), EPIVIR™ 
(lamivudine/3TC), and COMBIVIR™ (zidovudine/1 ami vudine). Non-nucleoside reverse 
transcriptase inhibitors that may be administered in combination with the Therapeutics of 
the invention, include, but are not limited to, VIRAMUNE™ (nevirapine), 
RESCRIPTOR™ (delavirdine), and SUSTIVA™ (efavirenz). Protease inhibitors that may 

25 be administered in combination with the Therapeutics of the invention, include, but are not 
limited to, CRIXIVAN™ (indinavir), NORVIR™ (ritonavir), INVIRASE™ (saquinavir), 
and VIRACEPT™ (nelfinavir). In a specific embodiment, antiretroviral agents, 
nucleoside reverse transcriptase inhibitors, non-nucleoside reverse transcriptase inhibitors, 
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and/or protease inhibitors may be used in any combination with Therapeutics of the 
invention to treat AIDS and/or to prevent or treat HIV infection. 

In other embodiments, Therapeutics of the invention may be administered in 
combination with anti-opportunistic infection agents. An ti -opportunistic agents that may 
5 be administered in combination with the Therapeutics of the invention, include, but are not 
limited to, TRIMETHOPRIM- SULFAMETHOXAZOLE™ , DAPSONE™, 
PENTAMIDINE™ , ATO V AQUONE™ , ISONIAZID™ , RIFAMPIN™, 
PYR AZINAMIDE™ , ETHAMBUTOL™, RIFABUTIN™, CLARITHROMYCIN™, 
AZITHROMYCIN™, GANCICLOVIR™, FO SCAR NET™ t CIDOFOVIR™ , 

10 FLUCONAZOLE™, ITRACONAZOLE™, KETOCONAZOLE™, ACYCLOVIR™, 
FAMCICOLVIR™, PYRIMETHAMINE™, LEUCOVORIN™ , NEUPOGEN™ 
(filgrastim/G-CSF), and LEUKINE™ (sargramostim/GM-CSF). In a specific 
embodiment, Therapeutics of the invention are used in any combination with 
TRIMETHOPRIM-SULFAMETHOXAZOLE™, DAPSONE™, PENTAMIDINE™, 

15 and/or ATOVAQUONE™ to prophyiacticaHy treat or prevent an opportunistic 
Pneumocystis carinii pneumonia infection. In another specific embodiment, Therapeutics 
of the invention are used in any combination with ISONIAZID™, RIFAMPIN™, 
PYR AZINAMIDE™, and/or ETHAMBUTOL™ to prophyiacticaHy treat or prevent an 
opportunistic Mycobacterium avium complex infection. In another specific embodiment, 

20 Therapeutics of the invention are used in any combination with RIFABUTIN™, 
CLARITHROMYCIN™, and/or AZITHROMYCIN™ to prophyiacticaHy treat or prevent 
an opportunistic Mycobacterium tuberculosis infection. In another specific embodiment, 
Therapeutics of the invention are used in any combination with GANCICLOVIR™, 
FOSCARNET™, and/or CIDOFOVIR™ to prophyiacticaHy treat or prevent an 

25 opportunistic cytomegalovirus infection. In another specific embodiment, Therapeutics of 
the invention are used in any combination with FLUCONAZOLE™, 
ITRACONAZOLE™, and/or KETOCONAZOLE™ to prophyiacticaHy treat or prevent an 
opportunistic fungal infection. In another specific embodiment, Therapeutics of the 
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invention are used in any combination with ACYCLOVIR™ and/or FAMCICOLVIR™ to 
prophylactically treat or prevent an opportunistic herpes simplex virus type I and/or type II 
infection. In another specific embodiment, Therapeutics of the invention are used in any 
combination with PYRIMETHAMINE™ and/or LEUCOVORIN™ to prophylactically 
5 treat or prevent an opportunistic Toxoplasma gondii infection. In another specific 
embodiment, Therapeutics of the invention are used in any combination with 
LEUCOVORIN™ and/or NEUPOGEN™ to prophylactically treat or prevent an 
opportunistic bacterial infection. 

In a further embodiment, the Therapeutics of the invention are administered in 
10 combination with an antiviral agent. Antiviral agents that may be administered with the 
Therapeutics of the invention include, but are not limited to, acyclovir, ribavirin, 
amantadine, and remantidine. 

In a further embodiment, the Therapeutics of the invention are administered in 
combination with an antibiotic agent. Antibiotic agents that may be administered with the 
15 Therapeutics of the invention include, but are not limited to, amoxicillin, beta-lactamases, 
aminoglycosides, beta-iactam (glycopeptide), beta-lactamases, Clindamycin, 
chloramphenicol, cephalosporins, ciprofloxacin, ciprofloxacin, erythromycin, 
fluoroquinolones, macrolides, metronidazole, penicillins, quinolones, rifampin, 
streptomycin, sulfonamide, tetracyclines, trimethoprim, trimethoprim-sulfamthoxazole, 
20 and vancomycin. 

Conventional nonspecific immunosuppressive agents, that may be administered in 
combination with the Therapeutics of the invention include, but are not limited to, 
steroids, cyclosporine, cyclosporine analogs, cyclophosphamide methylprednisone, 
prednisone, azathioprine, FK-506, 15-deoxyspergualin, and other immunosuppressive 
25 agents that act by suppressing the function of responding T cells. 

In specific embodiments, Therapeutics of the invention are administered in 
combination with immunosuppressants. Immunosuppressants preparations that may be 
administered with the Therapeutics of the invention include, but are not limited to, 
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ORTHOCLONE™ (OKT3), SANDIMMUNE™/NEORAL™/SANGDYA™ (cyclosporin), 
PROGRAF™ (tacrolimus), CELLCEPT™ (mycophenolate), Azathioprine, 
glucorticosteroids, and RAPAMUNE™ (sirolimus). In a specific embodiment, 
immunosuppressants may be used to prevent rejection of organ or bone marrow 
5 transplantation. 

In an additional embodiment, Therapeutics of the invention are administered alone 
or in combination with one or more intravenous immune globulin preparations. 
Intravenous immune globulin preparations that may be administered with the Therapeutics 
of the invention include, but not limited to, GAMMAR™, IVEEGAM™, 
10 SANDOGLOBULIN™, GAMMAGARD S/D™, and GAMIMUNE™ . In a specific 
embodiment, Therapeutics of the invention are administered in combination with 
intravenous immune globulin preparations in transplantation therapy (e.g., bone marrow 
transplant). 

In an additional embodiment, the Therapeutics of the invention are administered 
15 alone or in combination with an anti -inflammatory agent. Anti -inflammatory agents that 
may be administered with the Therapeutics of the invention include, but are not limited to, 
glucocorticoids and the nonsteroidal anti-inflammatories, aminoarylcarboxylic acid 
derivatives, arylacetic acid derivatives, arylbutyric acid derivatives, arylcarboxylic acids, 
arylpropionic acid derivatives, pyrazoles, pyrazolones, salicylic acid derivatives, 
20 thiazinecarboxamides, e-acetamidocaproic acid, S-adenosylmethionine, 3-amino-4- 
hydroxybutyric acid, amixetrine, bendazac, benzydamine, bucolome, difenpiramide, 
ditazol, emorfazone, guaiazulene, nabumetone, nimesulide, orgotein, oxaceprol, 
paranyline, perisoxal, pifoxime, proquazone, proxazole, and tenidap. 

In another embodiment, compostions of the invention are administered in 
25 combination with a chemotherapeutic agent. Chemotherapeutic agents that may be 
administered with the Therapeutics of the invention include, but are not limited to, 
antibiotic derivatives (e.g., doxorubicin, bleomycin, daunorubicin, and dactinomycin); 
antiestrogens (e.g., tamoxifen); antimetabolites (e.g., fluorouracil, 5-FU, methotrexate, 
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floxuridine, interferon alpha-2b, glutamic acid, plicamycin, mercaptopurine, and 6- 
thioguanine); cytotoxic agents (e.g., carmustine, BCNU, lomustine, CCNU, cytosine 
arabinoside, cyclophosphamide, estramustine, hydroxyurea, procarbazine, mitomycin, 
busulfan, cis-platin, and vincristine sulfate); hormones (e.g., medroxyprogesterone, 
estramustine phosphate sodium, ethinyl estradiol, estradiol, megestrol acetate, 
methyltestosterone, diethylstilbestrol diphosphate, chforotriamsene, and testolactone); 
nitrogen mustard derivatives (e.g., mephalen, chlorambucil, mechlorethamine (nitrogen 
mustard) and thiotepa); steroids and combinations (e.g., betamethasone sodium 
phosphate); and others (e.g., dicarbazine, asparaginase, mitotane, vincristine sulfate, 
vinblastine sulfate, and etoposide). 

In a specific embodiment, Therapeutics of the invention are administered in 
combination with CHOP (cyclophosphamide, doxorubicin, vincristine, and prednisone) or 
any combination of the components of CHOP. In another embodiment, Therapeutics of 
the invention are administered in combination with Rituximab. In a further embodiment, 
Therapeutics of the invention are administered with Rituxmab and CHOP, or Rituxmab 
and any combination of the components of CHOP. 

In an additional embodiment, the Therapeutics of the invention are administered in 
combination with cytokines. Cytokines that may be administered with the Therapeutics of 
the invention include, but are not limited to, IL2, IL3, IL4, IL5, IL6, IL7, IL10, IL12, 
IL13, IL15, anti-CD40, CD40L, IFN-gamma and TNF-alpha. In another embodiment, 
Therapeutics of the invention may be administered with any interleukin, including, but not 
limited to, IL-lalpha, IL-lbeta, IL-2, IL-3, IL-4, DL-5, IL-6, IL-7, IL-8, IL-9, IL-10, IL-11, 
IL-I2, IL-13, IL-14, IL-15, IL-16, IL-17, IL-18, IL-19, IL-20, and IL-21. 

In an additional embodiment, the Therapeutics of the invention are administered in 
combination with angiogenic proteins. Angiogenic proteins that may be administered with 
the Therapeutics of the invention include, but are not limited to, Glioma Derived Growth 
Factor (GDGF), as disclosed in European Patent Number EP-399816; Platelet Derived 
Growth Factor-A (PDGF-A), as disclosed in European Patent Number EP-682110; Platelet 
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Derived Growth Factor-B (PDGF-B), as disclosed in European Patent Number EP-282317; 
Placenta] Growth Factor (P1GF), as disclosed in International Publication Number WO 
92/06194; Placental Growth Factor-2 (PIGF-2), as disclosed in Hauser et al., Gorwth 
Factors, 4:259-268 (1993); Vascular Endothelial Growth Factor (VEGF), as disclosed in 
5 International Publication Number WO 90/13649; Vascular Endothelial Growth Factor-A 
(VEGF- A), as disclosed in European Patent Number EP-506477; Vascular Endothelial 
Growth Factor-2 (VEGF-2), as disclosed in International Publication Number WO 
96/39515; Vascular Endothelial Growth Factor B (VEGF-3); Vascular Endothelial Growth 
Factor B-186 (VEGF-B186), as disclosed in International Publication Number WO 

10 96/26736; Vascular Endothelial Growth Factor-D (VEGF-D), as disclosed in International 
Publication Number WO 98/02543; Vascular Endothelial Growth Factor-D (VEGF-D), as 
disclosed in Internationa] Publication Number WO 98/07832; and Vascular Endothelial 
Growth Factor-E (VEGF-E), as disclosed in German Patent Number DEI 9639601. The 
above mentioned references are incorporated herein by reference herein. 

15 In an additional embodiment, the Therapeutics of the invention are administered in 

combination with hematopoietic growth factors. Hematopoietic growth factors that may 
be administered with the Therapeutics of the invention include, but are not limited to, 
LEUKINE™ (SARGRAMOSTIM™) and NEUPOGEN™ (FILGRASTIM™). 

In an additional embodiment, the Therapeutics of the invention are administered in 

20 combination with Fibroblast Growth Factors. Fibroblast Growth Factors that may be 
administered with the Therapeutics of the invention include, but are not limited to, FGF-1, 
FGF-2, FGF-3, FGF-4, FGF-5, FGF-6, FGF-7, FGF-8, FGF-9, FGF-10, FGF-11, FGF-12, 
FGF-13, FGF-14, and FGF-15. 

In additional embodiments, the Therapeutics of the invention are administered in 
25 combination with other therapeutic or prophylactic regimens, such as, for example, 
radiation therapy. 

Example 24: Method of Treating Decreased Lev els of the Polypeptide 
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The present invention relates to a method for treating an individual in need of an 
increased level of a polypeptide of the invention in the body comprising administering to 
such an individual a composition comprising a therapeutically effective amount of an 
agonist of the invention (including polypeptides of the invention). Moreover, it will be 
appreciated that conditions caused by a decrease in the standard or normal expression level 
of a secreted protein in an individual can be treated by administering the polypeptide of the 
present invention, preferably in the secreted form. Thus, the invention also provides a 
method of treatment of an individual in need of an increased level of the polypeptide 
comprising administering to such an individual a Therapeutic comprising an amount of the 
polypeptide to increase the activity level of the polypeptide in such an individual. 

For example, a patient with decreased levels of a polypeptide receives a daily dose 
0.1-100 ug/kg of the polypeptide for six consecutive days. Preferably, the polypeptide is in 
the secreted form. The exact details of the dosing scheme, based on administration and 
formulation, are provided in Example 23. 

Exa mple 25: Method of Treating Increased Levels of the Polypep tide 

The present invention also relates to a method of treating an individual in need of a 
decreased level of a polypeptide of the invention in the body comprising administering to 
such an individual a composition comprising a therapeutically effective amount of an 
antagonist of the invention (including polypeptides and antibodies of the invention). 

In one example, antisense technology is used to inhibit production of a polypeptide 
of the present invention. This technology is one example of a method of decreasing levels 
of a polypeptide, preferably a secreted form, due to a variety of etiologies, such as cancer. 
For example, a patient diagnosed with abnormally increased levels of a polypeptide is 
administered intravenously antisense polynucleotides at 0.5, 1.0, 1.5, 2.0 and 3.0 mg/kg 
day for 21 days. This treatment is repeated after a 7-day rest period if the treatment was 
well tolerated. The formulation of the antisense polynucleotide is provided in Example 23. 
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Example 26: Method of Treatment I Jsin ff Oen e Therapv-Fx Vivn 

One method of gene therapy transplants fibroblasts, which are capable of 
expressing a polypeptide, onto a patient. Generally, fibroblasts are obtained from a subject 
by skin biopsy. The resulting tissue is placed in tissue-culture medium and separated into 
small pieces. Small chunks of the tissue are placed on a wet surface of a tissue culture 
flask, approximately ten pieces are placed in each flask. The flask is turned upside down, 
closed tight and left at room temperature over night. After 24 hours at room temperature, 
the flask is inverted and the chunks of tissue remain fixed to the bottom of the flask and 
fresh media (e.g., Ham's F12 media, with 10% FBS, penicillin and streptomycin) is added. 
The flasks are then incubated at 37 degree C for approximately one week. 

At this time, fresh media is added and subsequently changed every several days. 
After an additional two weeks in culture, a monolayer of fibroblasts emerge. The 
monolayer is trypsinized and scaled into larger flasks. 

pMV-7 (Kirschmeier, P.T. et al., DNA, 7:219-25 (1988)), flanked by the long 
terminal repeats of the Moloney murine sarcoma virus, is digested with EcoRI and Hindlll 
and subsequently treated with calf intestinal phosphatase. The linear vector is fractionated 
on agarose gel and purified, using glass beads. 

The cDNA encoding a polypeptide of the present invention can be amplified using 
PCR primers which correspond to the 5' and 3' end sequences respectively as set forth in 
Example 1 using primers and having appropriate restriction sites and initiation/stop 
codons, if necessary. Preferably, the 5* primer contains an EcoRI site and the 3' primer 
includes a Hindlll site. Equal quantities of the Moloney murine sarcoma virus linear 
backbone and the amplified EcoRI and Hindlll fragment are added together, in the 
presence of T4 DNA ligase. The resulting mixture is maintained under conditions 
appropriate for ligation of the two fragments. The ligation mixture is then used to 
transform bacteria HB 101, which are then plated onto agar containing kanamycin for the 
purpose of confirming that the vector has the gene of interest properly inserted. 
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The amphotropic pA317 or GP+aml2 packaging cells are grown in tissue culture to 
confluent density in Dulbecco's Modified Eagles Medium (DMEM) with 10% calf serum 
(CSX penicillin and streptomycin. The MSV vector containing the gene is then added to 
the media and the packaging cells transduced with the vector. The packaging cells now 
produce infectious viral particles containing the gene (the packaging cells are now referred 
to as producer cells). 

Fresh media is added to the transduced producer cells, and subsequently, the media 
is harvested from a 10 cm plate of confluent producer cells. The spent media, containing 
the infectious viral particles, is filtered through a millipore filter to remove detached 
producer cells and this media is then used to infect fibroblast cells. Media is removed from 
a sub-confluent plate of fibroblasts and quickly replaced with the media from the producer 
cells. This media is removed and replaced with fresh media. If the titer of virus is high, 
then virtually all fibroblasts will be infected and no selection is required. If the titer is very 
low, then it is necessary to use a retroviral vector that has a selectable marker, such as neo 
or his. Once the fibroblasts have been efficiently infected, the fibroblasts are analyzed to 
determine whether protein is produced. 

The engineered fibroblasts are then transplanted onto the host, either alone or after 
having been grown to confluence on cytodex 3 microcarrier beads. 

Example 27: Gene Therapy Using Endogeno u s Genes Corresponding To 
Polynucleotides of the Invention 

Another method of gene therapy according to the present invention involves 
operably associating the endogenous polynucleotide sequence of the invention with a 
promoter via homologous recombination as described, for example, in U.S. Patent NO: 
5,641,670, issued June 24, 1997; International Publication NO: WO 96/29411, published 
September 26, 1996; International Publication NO: WO 94/12650, published August 4, 
1994; Koller et ak, Proc. Natl Acad. Sci. USA, 86:8932-8935 (1989); and Zijlstra et al., 
Nature, 342:435-438 (1989). This method involves the activation of a gene which is 
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present in the target cells, but which is not expressed in the cells, or is expressed at a lower 
level than desired. 

Polynucleotide constructs are made which contain a promoter and targeting 
sequences, which are homologous to the 5' non-coding sequence of endogenous 
polynucleotide sequence, flanking the promoter. The targeting sequence will be 
sufficiently near the 5* end of the polynucleotide sequence so the promoter will be operably 
linked to the endogenous sequence upon homologous recombination. The promoter and 
the targeting sequences can be amplified using PCR. Preferably, the amplified promoter 
contains distinct restriction enzyme sites on the 5' and 3' ends. Preferably, the 3' end of the 
first targeting sequence contains the same restriction enzyme site as the 5' end of the 
amplified promoter and the 5' end of the second targeting sequence contains the same 
restriction site as the 3' end of the amplified promoter. 

The amplified promoter and the amplified targeting sequences are digested with the 
appropriate restriction enzymes and subsequently treated with calf intestinal phosphatase. 
The digested promoter and digested targeting sequences are added together in the presence 
of T4 DNA ligase. The resulting mixture is maintained under conditions appropriate for 
ligation of the two fragments. The construct is size fractionated on an agarose gel then 
purified by phenol extraction and ethanol precipitation. 

In this Example, the polynucleotide constructs are administered as naked 
polynucleotides via electroporation. However, the polynucleotide constructs may also be 
administered with transfection-facili taring agents, such as liposomes, viral sequences, viral 
particles, precipitating agents, etc. Such methods of delivery are known in the art. 

Once the cells are transfected, homologous recombination will take place which 
results in the promoter being operably linked to the endogenous polynucleotide sequence. 
This results in the expression of polynucleotide corresponding to the polynucleotide in the 
cell. Expression may be detected by immunological staining, or any other method known 
in the art. 
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Fibroblasts are obtained from a subject by skin biopsy. The resulting tissue is 
placed in DMEM + 10% fetal calf serum. Exponentially growing or early stationary phase 
fibroblasts are trypsinized and rinsed from the plastic surface with nutrient medium. An 
aliquot of the cell suspension is removed for counting, and the remaining cells are 
5 subjected to centrifugation. The supernatant is aspirated and the pellet is resuspended in 5 
ml of electroporation buffer (20 mM HEPES pH 7.3, 137 mM NaCl, 5 mM KC1, 0.7 mM 
Na 2 HP0 4 , 6 mM dextrose). The cells are receritrifuged, the supernatant aspirated, and the 
cells resuspended in electroporation buffer containing 1 mg/ml acetylated bovine serum 
albumin. The final cell suspension contains approximately 3X10 6 cells/ml. Electroporation 

10 should be performed immediately following resuspension. 

Plasmid DNA is prepared according to standard techniques. For example, to 
construct a plasmid for targeting to the locus corresponding to the polynucleotide of the 
invention, plasmid pUC18 (MBI Fermentas, Amherst, NY) is digested with Hindlll, The 
CMV promoter is amplified by PCR with an Xbal site on the 5' end and a BamHl site on 

15 the 3'end. Two non-coding sequences are amplified via PCR: one non-coding sequence 
(fragment 1) is amplified with a Hindlll site at the 5' end and an Xba site at the 3'end; the 
other non-coding sequence (fragment 2) is amplified with a BamHI site at the 5'end and a 
Hindlll site at the 3'end. The CMV promoter and the fragments (1 and 2) are digested with 
the appropriate enzymes (CMV promoter - Xbal and BamHI; fragment 1 - Xbal; fragment 

20 2 - BamHI) and ligated together. The resulting ligation product is digested with Hindlll, 
and ligated with the Hin dill-digested pUCIS plasmid. 

Plasmid DNA is added to a sterile cuvette with a 0.4 cm electrode gap (Bio-Rad). 
The final DNA concentration is generally at least 120 /ig/ml. 0.5 ml of the cell suspension 
(containing approximately 1.5.X10 6 cells) is then added to the cuvette, and the cell 

25 suspension and DNA solutions are gently mixed. Electroporation is performed with a 

Gene-Pulser apparatus (Bio-Rad). Capacitance and voltage are set at 960 fiF and 250-300 
V, respectively. As voltage increases, cell survival decreases, but the percentage of 
surviving cells that stably incorporate the introduced DNA into their genome increases 
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dramatically. Given these parameters, a pulse time of approximately 14-20 mSec should be 
observed. 

Electroporated cells are maintained at room temperature for approximately 5 min, 
and the contents of the cuvette are then gently removed with a sterile transfer pipette. The 
cells are added directly to 10 ml of prewarmed nutrient media (DMEM with 15% calf 
serum) in a 10 cm dish and incubated at 37 degree C. The following day, the media is 
aspirated and replaced with 10 ml of fresh media and incubated for a further 16-24 hours. 

The engineered fibroblasts are then injected into the host, either alone or after 
having been grown to confluence on cytodex 3 microcarrier beads. The fibroblasts now 
produce the protein product. The fibroblasts can then be introduced into a patient as 
described above. 


Example 28: Me thod of Treatment Using Gene Therapy - In Vivo 

Another aspect of the present invention is using in vivo gene therapy methods to 
treat disorders, diseases and conditions. The gene therapy method relates to the 
introduction of naked nucleic acid (DNA, RNA, and antisense DNA or RNA) sequences 
into an animal to increase or decrease the expression of the polypeptide. The 
polynucleotide of the present invention may be operatively linked to a promoter or any 
other genetic elements necessary for the expression of the polypeptide by the target tissue. 
Such gene therapy and delivery techniques and methods are known in the art, see, for 
example, WO90/11092, W098/H779; U.S. Patent NO. 5693622, 5705151, 5580859; 
Tabata et aK, Cardiovasc. Res. 35(3):470-479 (1997); Chao et al., Pharmacol. Res. 
35(6):517-522 (1997); Wolff, Neuromuscul. Disord. 7(5):314-318 (1997); Schwartz et aL, 
Gene Ther. 3(5):405-411 (1996); Tsurumi et al., Circulation 94(12):3281-3290 (1996) 
(incorporated herein by reference). 

The polynucleotide constructs may be delivered by any method that delivers 
injectable materials to the cells of an animal, such as, injection into the interstitial space of 
tissues (heart, muscle, skin, lung, liver, intestine and the like). The polynucleotide 
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constructs can be delivered in a pharmaceutically acceptable liquid or aqueous carrier. 

The term "naked" polynucleotide, DNA or RNA, refers to sequences that are free 
from any delivery vehicle that acts to assist, promote, or facilitate entry into the cell, 
including viral sequences, viral particles, liposome formulations, lipofectin or precipitating 
agents and the like. However, the polynucleotides of the present invention may also be 
delivered in liposome formulations (such as those taught in Feigner P.L. et al. (1995) Ann. 
NY Acad. Sci. 772:126-139 and Abdallah B. et al. (1995) Biol. Ceil 85(1): 1-7) which can 
be prepared by methods well known to those skilled in the art. 

The polynucleotide vector constructs used in the gene therapy method are 
preferably constructs that will not integrate into the host genome nor will they contain 
sequences that allow for replication. Any strong promoter known to those skilled in the art 
can be used for driving the expression of DNA. Unlike other gene therapies techniques, 
one major advantage of introducing naked nucleic acid sequences into target cells is the 
transitory nature of the polynucleotide synthesis in the cells. Studies have shown that non- 
replicating DNA sequences can be introduced into cells to provide production of the 
desired polypeptide for periods of up to six months. 

The polynucleotide construct can be delivered to the interstitial space of tissues 
within the an animal, including of muscle, skin, brain, lung, liver, spleen, bone marrow, 
thymus, heart, lymph, blood, bone, cartilage, pancreas, kidney, gall bladder, stomach, 
intestine, testis, ovary, uterus, rectum, nervous system, eye, gland, and connective tissue. 
Interstitial space of the tissues comprises the intercellular fluid, mucopolysaccharide 
matrix among the reticular fibers of organ tissues, elastic fibers in the walls of vessels or 
chambers, collagen fibers of fibrous tissues, or that same matrix within connective tissue 
ensheathing muscle cells or in the lacunae of bone. It is similarly the space occupied by 
the plasma of the circulation and the lymph fluid of the lymphatic channels. Delivery to 
the interstitial space of muscle tissue is preferred for the reasons discussed below. They 
may be conveniently delivered by injection into the tissues comprising these cells. They 
are preferably delivered to and expressed in persistent, non-dividing cells which are 
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differentiated, although delivery and expression may be achieved in non -differentiated or 
less completely differentiated cells, such as, for example, stem cells of blood or skin 
fibroblasts. In vivo muscle cells are particularly competent in their ability to take up and 
express polynucleotides. 

For the naked polynucleotide injection, an effective dosage amount of DNA or 
RNA will be in the range of from about 0.05 g/kg body weight to about 50 mg/kg body 
weight. Preferably the dosage will be from about 0.005 mg/kg to about 20 mg/kg and 
more preferably from about 0.05 mg/kg to about 5 mg/kg. Of course, as the artisan of 
ordinary skill will appreciate, this dosage will vary according to the tissue site of injection. 
The appropriate and effective dosage of nucleic acid sequence can readily be determined 
by those of ordinary skill in the art and may depend on the condition being treated and the 
route of administration. The preferred route of administration is by the parenteral route of 
injection into the interstitial space of tissues. However, other parenteral routes may also be 
used, such as, inhalation of an aerosol formulation particularly for delivery to lungs or 
bronchial tissues, throat or mucous membranes of the nose. In addition, naked 
polynucleotide constructs can be delivered to arteries during angioplasty by the catheter 
used in the procedure. 

The dose response effects of injected polynucleotide in muscle in vivo is 
determined as follows. Suitable template DNA for production of mRNA coding for 
polypeptide of the present invention is prepared in accordance with a standard recombinant 
DNA methodology. The template DNA, which may be either circular or linear, is either 
used as naked DNA or complexed with liposomes. The quadriceps muscles of mice are 
then injected with various amounts of the template DNA. 

Five to six week old female and male Balb/C mice are anesthetized by 
intraperitoneal injection with 0.3 ml of 2.5% Avertin. A 1.5 cm incision is made on the 
anterior thigh, and the quadriceps muscle is directly visualized. The template DNA is 
injected in 0.1 ml of carrier in a 1 cc syringe through a 27 gauge needle over one minute, 
approximately 0.5 cm from the distal insertion site of the muscle into the knee and about 
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0.2 cm deep. A suture is placed over the injection site for future localization, and the skin 
is closed with stainless steel clips. 

After an appropriate incubation time (e.g., 7 days) muscle extracts are prepared by 
excising the entire quadriceps. Every fifth 15 um cross-section of the individual 
5 quadriceps muscles is histochemically stained for protein expression. A time course for 
protein expression may be done in a similar fashion except that quadriceps from different 
mice are harvested at different times. Persistence of DNA in muscle following injection 
may be determined by Southern blot analysis after preparing total cellular DNA and HIRT 
supernatants from injected and control mice. The results of the above experimentation in 
10 mice can be use to extrapolate proper dosages and other treatment parameters in humans 
and other animals using naked DNA. 

Example 29: Transgenic Animals. 

The polypeptides of the invention can also be expressed in transgenic animals. 

15 Animals of any species, including, but not limited to, mice, rats, rabbits, hamsters, guinea 
pigs, pigs, micro-pigs, goats, sheep, cows and non-human primates, e.g., baboons, 
monkeys, and chimpanzees may be used to generate transgenic animals. In a specific 
embodiment, techniques described herein or otherwise known in the art, are used to 
express polypeptides of the invention in humans, as part of a gene therapy protocol. 

20 Any technique known in the art may be used to introduce the transgene (i.e., 

polynucleotides of the invention) into animals to produce the founder lines of transgenic 
animals. Such techniques include, but are not limited to, pronuclear microinjection 
(Paterson et al., Appl. Microbiol. Biotechnol. 40:691-698 (1994); Carver et al., 
Biotechnology (NY) 11:1263-1270 (1993); Wright et al., Biotechnology (NY) 9:830-834 

25 (1991); and Hoppe et al., U.S. Pat. No. 4,873,191 (1989)); retrovirus mediated gene 
transfer into germ lines (Van der Putten et al., Proc. Natl. Acad. Sci., USA 82:6148-6152 
(1985)), blastocysts or embryos; gene targeting in embryonic stem cells (Thompson et al., 
Cell 56:313-321 (1989)); electroporation of cells or embryos (Lo, 1983, Mol Cell. Biol. 
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3:1803-1814 (1983)); introduction of the polynucleotides of the invention using a gene gun 
(see, e.g., Ulmer et ak, Science 259:1745 (1993); introducing nucleic acid constructs into 
embryonic pleuripotent stem cells and transferring the stem cells back into the blastocyst; 
and sperm-mediated gene transfer (Lavitrano et al., Cell 57:717-723 (1989); etc. For a 
5 review of such techniques, see Gordon, "Transgenic Animals," Intl. Rev. Cytol. 115:171- 
229 (1989), which is incorporated by reference herein in its entirety. 

Any technique known in the art may be used to produce transgenic clones 
containing polynucleotides of the invention, for example, nuclear transfer into enucleated 
oocytes of nuclei from cultured embryonic, fetal, or adult cells induced to quiescence 

10 (Campell et al., Nature 380:64-66 (1996); Wilmut et aL, Nature 385:810-813 (1997)). 

The present invention provides for transgenic animals that carry the transgene in all 
their cells, as well as animals which carry the transgene in some, but not all their cells, i.e., 
mosaic animals or chimeric. The transgene may be integrated as a single transgene or as 
multiple copies such as in concatamers, e.g., head-to-head tandems or head-to-tail 

15 tandems. The transgene may also be selectively introduced into and activated in a 
particular cell type by following, for example, the teaching of Lasko et al. (Lasko et al., 
Proc. Natl. Acad. Sci. USA 89:6232-6236 (1992)). The regulatory sequences required for 
such a cell -type specific activation will depend upon the particular cell type of interest, and 
will be apparent to those of skill in the art. When it is desired that the polynucleotide 

20 transgene be integrated into the chromosomal site of the endogenous gene, gene targeting 
is preferred. Briefly, when such a technique is to be utilized, vectors containing some 
nucleotide sequences homologous to the endogenous gene are designed for the purpose of 
integrating, via homologous recombination with chromosomal sequences, into and 
disrupting the function of the nucleotide sequence of the endogenous gene. The transgene 

25 may also be selectively introduced into a particular cell type, thus inactivating the 
endogenous gene in only that cell type, by following, for example, the teaching of Gu et al. 
(Gu et al., Science 265:103-106 (1994)). The regulatory sequences required for such a 
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cell-type specific inactivation will depend upon the particular cell type of interest, and will 
be apparent to those of skill in the art. 

Once transgenic animals have been generated, the expression of the recombinant 
gene may be assayed utilizing standard techniques. Initial screening may be accomplished 
5 by Southern blot analysis or PCR techniques to analyze animal tissues to verify that 
integration of the transgene has taken place. The level of mRNA expression of the 
transgene in the tissues of the transgenic animals may also be assessed using techniques 
which include, but are not limited to, Northern blot analysis of tissue samples obtained 
from the animal, in situ hybridization analysis, and reverse transcriptase-PCR (rt-PCR). 
10 Samples of transgenic gene-expressing tissue may also be evaluated 
immunocytochemically or immunohistochemically using antibodies specific for the 
transgene product. 

Once the founder animals are produced, they may be bred, inbred, outbred, or 
crossbred to produce colonies of the particular animal. Examples of such breeding 

15 strategies include, but are not limited to: outbreeding of founder animals with more than 
one integration site in order to establish separate lines; inbreeding of separate lines in order 
to produce compound transgenics that express the transgene at higher levels because of the 
effects of additive expression of each transgene; crossing of heterozygous transgenic 
animals to produce animals homozygous for a given integration site in order to both 

20 augment expression and eliminate the need for screening of animals by DNA analysis; 
crossing of separate homozygous lines to produce compound heterozygous or homozygous 
lines; and breeding to place the transgene on a distinct background that is appropriate for 
an experimental model of interest. 

Transgenic animals of the invention have uses which include, but are not limited to, 

25 animal model systems useful in elaborating the biological function of polypeptides of the 
present invention, studying conditions and/or disorders associated with aberrant 
expression, and in screening for compounds effective in ameliorating such conditions 
and/or disorders. 
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Example 30: Knock- Out Animals. 

Endogenous gene expression can also be reduced by inactivating or "knocking out" 
the gene and/or its promoter using targeted homologous recombination. (E.g., see 
Smithies et al., Nature 317:230-234 (1985); Thomas & Capecchi, Cell 51:503-512 (1987); 
Thompson et al., Ceil 5:313-321 (1989); each of which is incorporated by reference herein 
in its entirety). For example, a mutant, non-functional polynucleotide of the invention (or 
a completely unrelated DNA sequence) flanked by DNA homologous to the endogenous 
polynucleotide sequence (either the coding regions or regulatory regions of the gene) can 
be used, with or without a selectable marker and/or a negative selectable marker, to 
transfect cells that express polypeptides of the invention in vivo. In another embodiment, 
techniques known in the art are used to generate knockouts in cells that contain, but do not 
express the gene of interest. Insertion of the DNA construct, via targeted homologous 
recombination, results in inactivation of the targeted gene. Such approaches are 
particularly suited in research and agricultural fields where modifications to embryonic 
stem cells can be used to generate animal offspring with an inactive targeted gene {e.g., see 
Thomas & Capecchi 1987 and Thompson 1989, supra). However this approach can be 
routinely adapted for use in humans provided the recombinant DNA constructs are directly 
administered or targeted to the required site in vivo using appropriate viral vectors that will 
be apparent to those of skill in the art. 

In further embodiments of the invention, cells that are genetically engineered to 
express the polypeptides of the invention, or alternatively, that are genetically engineered 
not to express the polypeptides of the invention (e.g., knockouts) are administered to a 
patient in vivo. Such ceils may be obtained from the patient (i.e., animal, including 
human) or an MHC compatible donor and can include, but are not limited to fibroblasts, 
bone marrow cells, blood cells (e^, lymphocytes), adipocytes, muscle cells, endothelial 
cells etc. The cells are genetically engineered in vitro using recombinant DNA techniques 
to introduce the coding sequence of polypeptides of the invention into the cells, or 
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alternatively, to disrupt the coding sequence and/or endogenous regulatory sequence 
associated with the polypeptides of the invention, e^, by transduction (using viral vectors, 
and preferably vectors that integrate the transgene into the cell genome) or transfection 
procedures, including, but not limited to, the use of plasmids, cosmids, YACs, naked 
DNA, electroporation, liposomes, etc. The coding sequence of the polypeptides of the 
invention can be placed under the control of a strong constitutive or inducible promoter or 
promoter/enhancer to achieve expression, and preferably secretion, of the polypeptides of 
the invention. The engineered ceils which express and preferably secrete the polypeptides 
of the invention can be introduced into the patient systemically, e.g., in the circulation, or 
intraperitoneal ly. 

Alternatively, the cells can be incorporated into a matrix and implanted in the body, 
e^, genetically engineered fibroblasts can be implanted as part of a skin graft; genetically 
engineered endothelial cells can be implanted as part of a lymphatic or vascular graft. 
(See, for example, Anderson et al. U.S. Patent No. 5,399,349; and Mulligan & Wilson, 
U.S. Patent No. 5,460,959 each of which is incorporated by reference herein in its 
entirety). 

When the cells to be administered are non-autologous or non-MHC compatible 
cells, they can be administered using well known techniques which prevent the 
development of a host immune response against the introduced cells. For example, the 
cells may be introduced in an encapsulated form which, while allowing for an exchange of 
components with the immediate extracellular environment, does not allow the introduced 
cells to be recognized by the host immune system. 

Transgenic and "knock-out" animals of the invention have uses which include, but 
are not limited to, animal model systems useful in elaborating the biological function of 
polypeptides of the present invention, studying conditions and/or disorders associated with 
aberrant expression, and in screening for compounds effective in ameliorating such 
conditions and/or disorders. 
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Example 31. Isolation of a ntibody fr a gm e nts directed ay a inst polyp e ptides of the 
invention from a library of scFvs. 

Naturally occurring V-genes isolated from human PBLs are constructed into a 
large library of antibody fragments which contain reactivities against a polypeptide 
having the amino acid sequence of SEQ ID NO: Y to which the donor may or may not 
have been exposed (see e.g., U.S. Patent 5,885,793 incorporated herein in its entirety by 
reference). 

Rescue of the library. 

A library of scFvs is constructed from the RNA of human PBLs as described in 
WO92/01047. To rescue phage displaying antibody fragments, approximately 10 9 E. 
coli harboring the phagemid are used to inoculate 50 ml of 2x TY containing 1% 
glucose and 100 micrograms/ml of ampicillin (2xTY-AMP-GLU) and grown to an O.D. 
of 0.8 with shaking. Five ml of this culture is used to inoculate 50 ml of 2xTY-AMP- 
GLU, 2 x 108 TU of delta gene 3 helper (M13 delta gene III, see WO92/0I047) are 
added and the culture incubated at 37°C for 45 minutes without shaking and then at 
37°C for 45 minutes with shaking. The culture is centrifuged at 4000 r.p.m. for 10 min. 
and the pellet resuspended in 2 liters of of 2x TY containing 100 micrograms/ml 
ampicillin and 50 micrograms/ml kanamycin and grown overnight. Phage are prepared 
as described in WO92/01047. 

M13 delta gene III is prepared as follows: M13 delta gene III helper phage does 
not encode gene III protein, hence the phage(mid) displaying antibody fragments have a 
greater avidity of binding to antigen. Infectious M13 delta gene III particles are made by 
growing the helper phage in cells harboring a pUC19 derivative supplying the wild type 
gene III protein during phage morphogenesis. The culture is incubated for 1 hour at 
37°C without shaking and then for a further hour at 37°C with shaking. Cells were spun 
down (IEC-Centra 8, 4000 revs/min for 10 min), resuspended in 300 ml 2x TY broth 
containing 100 micrograms ampicillin/ml and 25 micrograms kanamycin/ml (2x TY- 
AMP-KAN) and grown overnight, shaking at 37°C. Phage particles are purified and 
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concentrated from the culture medium by two PEG-precipitations (Sambrook et al., 
1990), resuspended in 2 ml PBS and passed through a 0.45 micrometer filter (Minisart 
NML; Sartorius) to give a final concentration of approximately 10 13 transducing 
units/ml (ampicillin-resistant clones). 
Panning the Library. 

Immunotubes (Nunc) are coated overnight in PBS with 4 ml of either 100 
micrograms/ml or 10 micrograms/ml of a polypeptide of the present invention. Tubes 
are blocked with 2% Marvel-PBS for 2 hours at 3TC and then washed 3 times in PBS. 
Approximately 10 13 TU of phage is applied to the tube and incubated for 30 minutes at 
room temperature tumbling on an over and under turntable and then left to stand for 
another 1.5 hours. Tubes are washed 10 times with PBS 0.1% Tween-20 and 10 times 
with PBS. Phage are eluted by adding 1 ml of 100 mM triethylamine and rotating 15 
minutes on an under and over turntable after which the solution is immediately 
neutralized with 0.5 mi of 1.0M Tris-HCl, pH 7.4. Phage are then used to infect 10 ml 
of mid-log E. coli TGI by incubating eluted phage with bacteria for 30 minutes at 37°C. 
The E. coli are then plated on TYE plates containing 1% glucose and 100 
micrograms/ml ampicillin. The resulting bacterial library is then rescued with delta 
gene 3 helper phage as described above to prepare phage for a subsequent round of 
selection. This process is then repeated for a total of 4 rounds of affinity purification 
with tube-washing increased to 20 times with PBS, 0.1% Tween-20 and 20 times with 
PBS for rounds 3 and 4. 

Characterization of Binders. 

Eluted phage from the third and fourth rounds of selection are used to infect E. coli 
HB 2151 and soluble scFv is produced (Marks, et al., 1991) from single colonies for 
assay. ELISAs are performed with microtiter plates coated with either 10 picograms/ml of 
the polypeptide of the present invention in 50 mM bicarbonate pH 9.6. Clones positive in 
ELISA are further characterized by PCR fingerprinting (see e.g., WO92/01047) and then 
by sequencing. 
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ffremple 32: Assays Detecting Stimulation n r Inhibition of B cell Proliferation and 
Differentiation 

Generation of functional humoral immune responses requires both soluble and cognate 
signaling between B-lineage cells and their microenvironment. Signals may impart a positive 
stimulus that allows a B-lineage cell to continue its programmed development, or a negative 
stimulus that instructs the cell to arrest its current developmental pathway. To date, numerous 
stimulatory and inhibitory signals have been found to influence B cell responsiveness including 
IL-2, IL-4, IL-5, IL-6, IL-7, IL10, IL-13, IL-14 and IL-15. Interestingly, these signals are by 
themselves weak effectors but can. in combination with various co-stimulatory proteins, induce 
activation, proliferation, differentiation, homing, tolerance and death among B cell populations. 

One of the best studied classes of B-cell co-stimulatory proteins is the TNF- 
superfamily. Within this family CD40, CD27, and CD30 along with their respective ligands 
CDI54, CD70, and CD153 have been found to regulate a variety of immune responses. Assays 
which allow for the detection and/or observation of the proliferation and differentiation of these 
B-cell populations and their precursors are valuable tools in determining the effects various 
proteins may have on these B-cell populations in terms of proliferation and differentiation. 
Listed below are two assays designed to allow for the detection of the differentiation, 
proliferation, or inhibition of B-cell populations and their precursors. 

In Vitro Assay- Purified polypeptides of the invention, or truncated forms thereof, 
is assessed for its ability to induce activation, proliferation, differentiation or inhibition 
and/or death in B-cell populations and their precursors. The activity of the polypeptides of 
the invention on purified human tonsillar B cells, measured qualitatively over the dose 
range from 0.1 to 10,000 ng/mL, is assessed in a standard B-lymphocyte co-stimulation 
assay in which purified tonsillar B cells are cultured in the presence of either formalin- 
fixed Staphylococcus aureus Cowan I (SAC) or immobilized anti-human IgM antibody as 
the priming agent. Second signals such as IL-2 and IL-15 synergize with SAC and IgM 
crosslinking to elicit B cell proliferation as measured by tritiated-thymidine incorporation. 
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Novel synergizing agents can be readily identified using this assay. The assay involves 
isolating human tonsillar B cells by magnetic bead (MACS) depletion of CD3-positive 
cells. The resulting cell population is greater than 95% B cells as assessed by expression 
of CD45R(B220). 

Various dilutions of each sample are placed into individual wells of a 96-well plate 
to which are added 10 5 B-cells suspended in culture medium (RPMI 1640 containing 10% 
FBS, 5 X 10- 5 M 2ME, lOOU/ml penicillin, lOug/ml streptomycin, and 10 5 dilution of 
SAC) in a total volume of 150ul. Proliferation or inhibition is quantitated by a 20h pulse 
(luCi/well) with 3H-thymidine (6.7 Ci/mM) beginning 72h post factor addition. The 
positive and negative controls are IL2 and medium respectively. 

In Vivo Assay- BALB/c mice are injected (i.p.) twice per day with buffer only, or 
2 mg/Kg of a polypeptide of the invention, or truncated forms thereof. Mice receive this 
treatment for 4 consecutive days, at which time they are sacrificed and various tissues and 
serum collected for analyses. Comparison of H&E sections from normal spleens and 
spleens treated with polypeptides of the invention identify the results of the activity of the 
polypeptides on spleen cells, such as the diffusion of peri-arterial lymphatic sheaths, and/or 
significant increases in the nucleated celtularity of the red pulp regions, which may 
indicate the activation of the differentiation and proliferation of B-cell populations. 
Immunohistochemical studies using a B cell marker, anti-CD45R(B220), are used to 
determine whether any physiological changes to splenic cells, such as splenic 
disorganization, are due to increased B-cell representation within loosely defined B~cell 
zones that infiltrate established T-cell regions. 

Flow cytometric analyses of the spleens from mice treated with polypeptide is used 
to indicate whether the polypeptide specifically increases the proportion of ThB+, 
CD45R(B220)dull B cells over that which is observed in control mice. 

Likewise, a predicted consequence of increased mature B-cell representation in 
vivo is a relative increase in serum Ig titers. Accordingly, serum IgM and IgA levels are 
compared between buffer and polypeptide-treated mice. 
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The studies described in this example tested activity of a polypeptide of the 
invention. However, one skilled in the art could easily modify the exemplified studies to 
test the activity of polynucleotides of the invention (e.g., gene therapy), agonists, and/or 
antagonists of polynucleotides or polypeptides of the invention. 

Example 33: T Cell Proliferation Assay 

A CD3-induced proliferation assay is performed on PBMCs and is measured by the 
uptake of 3 H-thymidine. The assay is performed as follows. Ninety-six well plates are 
coated with 100 jal/well of mAb to CD3 (HIT3a, Pharmingen) or isotype-matched control 
mAb (B33.1) overnight at 4 degrees C (1 |Lig/m] in .05M bicarbonate buffer, pH 9.5), then 
washed three times with PBS. PBMC are isolated by F/H gradient centrifugation from 
human peripheral blood and added to quadruplicate wells (5 x 107well) of mAb coated 
plates in RPMI containing 10% FCS and P/S in the presence of varying concentrations of 
polypeptides of the invention (total volume 200 ul). Relevant protein buffer and medium 
alone are controls. After 48 hr. culture at 37 degrees C, plates are spun for 2 min. at 1000 
rpm and 100 ^1 of supernatant is removed and stored -20 degrees C for measurement of 
IL-2 (or other cytokines) if effect on proliferation is observed. Wells are supplemented 
with 100 ul of medium containing 0.5 uCi of 3 H-thymidine and cultured at 37 degrees C 
for 18-24 hr. Wells are harvested and incorporation of 3 H-thymidine used as a measure of 
proliferation. Anti-CD3 alone is the positive control for proliferation. IL-2 (100 U/ml) is 
also used as a control which enhances proliferation. Control antibody which does not 
induce proliferation of T cells is used as the negative controls for the effects of 
polypeptides of the invention. 

The studies described in this example tested activity of polypeptides of the 
invention. However, one skilled in the art could easily modify the exemplified studies to 
test the activity of polynucleotides of the invention (e.g., gene therapy), agonists, and/or 
antagonists of polynucleotides or polypeptides of the invention. 
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Example 34: Effect of Polypeptides of th e Invention on the Expression of MHC Class 
II, Costimulatorv and Adhesion Molecules and Cell Differentiation of Monocytes and 
Monocvte-Derived Human Dendritic Cells 

Dendritic cells are generated by the expansion of proliferating precursors found in 
the peripheral blood: adherent PBMC or elutriated monocytic fractions are cultured for 7- 
10 days with GM-CSF (50 ng/ml) and IL-4 (20 ng/ml). These dendritic cells have the 
characteristic phenotype of immature cells (expression of CD1, CD80, CD86, CD40 and 
MHC ciass II antigens). Treatment with activating factors, such as TNF-a, causes a rapid 
change in surface phenotype (increased expression of MHC class I and II, costimulatory 
and adhesion molecules, down regulation of FCyRII, upregulation of CDS3). These 
changes correlate with increased antigen-presenting capacity and with functional 
maturation of the dendritic cells. 

FACS analysis of surface antigens is performed as follows. Cells are treated 1-3 
days with increasing concentrations of polypeptides of the invention or LPS (positive 
control), washed with PBS containing 1% BSA and 0.02 mM sodium azide, and then 
incubated with 1:20 dilution of appropriate FITC- or PE-Iabeled monoclonal antibodies for 
30 minutes at 4 degrees C. After an additional wash, the labeled cells are analyzed by flow 
cytometry on a FACScan (Becton Dickinson). 

Effect on the produc tion of cytokines . Cytokines generated by dendritic cells, in 
particular IL-12, are important in the initiation of T-cell dependent immune responses. IL- 
12 strongly influences the development of Th] helper T-cell immune response, and induces 
cytotoxic T and NK cell function. An ELISA is used to measure the IL-12 release as 
follows. Dendritic cells (10 6 /ml) are treated with increasing concentrations of polypeptides 
of the invention for 24 hours. LPS (100 ng/ml) is added to the cell culture as positive 
control. Supematants from the cell cultures are then collected and analyzed for IL-12 
content using commercial ELISA kit (e..g, R&D Systems (Minneapolis, MN)). The 
standard protocols provided with the kits are used. 
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Effect on the expression of MHC Class II, costimulatory and adhesion molecules. 
Three major families of cell surface antigens can be identified on monocytes: adhesion 
molecules, molecules involved in antigen presentation, and Fc receptor. Modulation of the 
expression of MHC class II antigens and other costimulatory molecules, such as B7 and 
ICAM-1, may result in changes in the antigen presenting capacity of monocytes and ability 
to induce T cell activation. Increase expression of Fc receptors may correlate with 
improved monocyte cytotoxic activity, cytokine release and phagocytosis. 

FACS analysis is used to examine the surface antigens as follows. Monocytes are 
treated 1-5 days with increasing concentrations of polypeptides of the invention or LPS 
(positive control), washed with PBS containing 1% BSA and 0.02 mM sodium azide, and 
then incubated with 1:20 dilution of appropriate FITC- or PE-labeled monoclonal 
antibodies for 30 minutes at 4 degreesC. After an additional wash, the labeled cells are 
analyzed by flow cytometry on a FACScan (Becton Dickinson). 

Mono cyte activation and/or increased survival. Assays for molecules that activate 
(or alternatively, inactivate) monocytes and/or increase monocyte survival (or 
alternatively, decrease monocyte survival) are known in the art and may routinely be 
applied to determine whether a molecule of the invention functions as an inhibitor or 
activator of monocytes. Polypeptides, agonists, or antagonists of the invention can be 
screened using the three assays described below. For each of these assays, Peripheral 
blood mononuclear cells (PBMC) are purified from single donor leukopacks (American 
Red Cross, Baltimore, MD) by centrifugation through a Histopaque gradient (Sigma). 
Monocytes are isolated from PBMC by counterflow centrifugal elutriation. 

Monocyte Survival Assay. Human peripheral blood monocytes progressively lose 
viability when cultured in absence of serum or other stimuli. Their death results from 
internally regulated process (apoptosis). Addition to the culture of activating factors, such 
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as TNF-alpha dramatically improves eel] survival and prevents DNA fragmentation. 
Propidium iodide (PI) staining is used to measure apoptosis as follows. Monocytes are 
cultured for 48 hours in polypropylene tubes in serum-free medium (positive control), in 
the presence of 100 ng/ml TNF-alpha (negative control), and in the presence of varying 
concentrations of the compound to be tested. Cells are suspended at a concentration of 2 x 
I0°/ml in PBS containing PI at a final concentration of 5 U-g/ml, and then incubaed at room 
temperature for 5 minutes before FACScan analysis. PI uptake has been demonstrated to 
correlate with DNA fragmentation in this experimental paradigm. 

Effect on cytokine release. An important function of monocytes/macrophages is 
their regulatory activity on other cellular populations of the immune system through the 
release of cytokines after stimulation. An ELISA to measure cytokine release is performed 
as follows. Human monocytes are incubated at a density of 5xl0 3 cells/ml with increasing 
concentrations of the a polypeptide of the invention and under the same conditions, but in 
the absence of the polypeptide. For IL-12 production, the cells are primed overnight with 
IFN (100 U/ml) in presence of a polypeptide of the invention. LPS (10 ng/ml) is then 
added. Conditioned media are collected after 24h and kept frozen until use. Measurement 
of TNF-alpha, IL-10, MCP-1 and IL-8 is then performed using a commercially available 
ELISA kit (e..g, R&D Systems (Minneapolis, MN)) and applying the standard protocols 
provided with the kit. 

Oxidative burst. Purified monocytes are plated in 96-w plate at 2-lxlO 5 cell/well. 
Increasing concentrations of polypeptides of the invention are added to the wells in a total 
volume of 0.2 ml culture medium (RPMI 1640 + 10% FCS, glutamine and antibiotics). 
After 3 days incubation, the plates are centrifuged and the medium is removed from the 
wells. To the macrophage monolayers, 0.2 ml per well of phenol red solution (140 mM 
NaCl, 10 mM potassium phosphate buffer pH 7.0, 5.5 mM dextrose, 0.56 mM phenol red 
and 19 U/ml of HRPO) is added, together with the stimulant (200 nM PMA). The plates 
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are incubated at 37°C for 2 hours and the reaction is stopped by adding 20 /xl IN NaOH 
per well. The absorbance is read at 610 nm. To calculate the amount of H 2 0 2 produced by 
the macrophages, a standard curve of a H 2 0 2 solution of known molarity is performed for 
each experiment. 

5 The studies described in this example tested activity of a polypeptide of the 

invention. However, one skilled in the art could easily modify the exemplified studies to 
test the activity of polypeptides, polynucleotides (e.g., gene therapy), agonists, and/or 
antagonists of the invention. 

10 Example 35: Biological Effects of Polypeptides of the Invention 

Astrocyte and Neuronal Assays - 

Recombinant polypeptides of the invention, expressed in Escherichia coli and 
purified as described above, can be tested for activity in promoting the survival, neurite 
outgrowth, or phenotypic differentiation of cortical neuronal cells and for inducing the 

15 proliferation of glial fibrillary acidic protein immunopositive cells, astrocytes. The 
selection of cortical cells for the bioassay is based on the prevalent expression of FGF-1 
and FGF-2 in cortical structures and on the previously reported enhancement of cortical 
neuronal survival resulting from FGF-2 treatment. A thymidine incorporation assay, for 
example, can be used to elucidate a polypeptide of the invention's activity on these cells. 

20 Moreover, previous reports describing the biological effects of FGF-2 (basic FGF) 

on cortical or hippocampal neurons in vitro have demonstrated increases in both neuron 
survival and neurite outgrowth (Walicke et al., "Fibroblast growth factor promotes survival 
of dissociated hippocampal neurons and enhances neurite extension." Proc, NatL Acad. 
Sci. USA 55:3012-3016. (1986), assay herein incorporated by reference in its entirety). 

25 However, reports from experiments done on PC- 12 cells suggest that these two responses 
are not necessarily synonymous and may depend on not only which FGF is being tested 
but also on which receptor(s) are expressed on the target cells. Using the primary cortical 
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neuronal culture paradigm, the ability of a polypeptide of the invention to induce neurite 
outgrowth can be compared to the response achieved with FGF-2 using, for example, a 
thymidine incorporation assay. 


Fibroblast and endothelial cell assays - 

Human lung fibroblasts are obtained from Clonetics (San Diego, CA) and 
maintained in growth media from Clonetics. Dermal microvascular endothelial cells are 
obtained from Cell Applications (San Diego, CA). For proliferation assays, the human 
lung fibroblasts and dermal microvascular endothelial cells can be cultured at 5,000 
cells/well in a 96-well plate for one day in growth medium. The cells are then incubated 
for one day in 0.1% BSA basal medium. After replacing the medium with fresh 0.1% BSA 
medium, the cells are incubated with the test proteins for 3 days. Alamar Blue (Alamar 
Biosciences, Sacramento, CA) is added to each well to a final concentration of 10%. The 
cells are incubated for 4 hr. Cell viability is measured by reading in a CytoFluor 
fluorescence reader. For the PGE 2 assays, the human lung fibroblasts are cultured at 5,000 
cells/well in a 96-well plate for one day. After a medium change to 0.1% BSA basal 
medium, the cells are incubated with FGF-2 or polypeptides of the invention with or 
without IL-la for 24 hours. The supernatants are collected and assayed for PGE, by EI A 
kit (Cayman, Ann Arbor, MI). For the IL-6 assays, the human lung fibroblasts are cultured 
at 5,000 cells/well in a 96-well plate for one day. After a medium change to 0.1% BSA 
basal medium, the cells are incubated with FGF-2 or with or without polypeptides of the 
invention IL-la for 24 hours. The supernatants are collected and assayed for IL-6 by 
ELISA kit (Endogen, Cambridge, MA). 

Human lung fibroblasts are cultured with FGF-2 or polypeptides of the invention 
for 3 days in basal medium before the addition of Alamar Blue to assess effects on growth 
of the fibroblasts. FGF-2 should show a stimulation at 10 - 2500 ng/ml which can be used 
to compare stimulation with polypeptides of the invention. 
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containing hormonal supplements (Nl). The cultures are fixed with paraformaldehyde 
after 8 days in vitro and are processed for tyrosine hydroxylase, a specific marker for 
dopminergic neurons, immunohistochemical staining. Dissociated cell cultures are 
prepared from embryonic rats. The culture medium is changed every third day and the 
factors are also added at that time. 

Since the dopaminergic neurons are isolated from animals at gestation day 14, a 
developmental time which is past the stage when the dopaminergic precursor cells are 
proliferating, an increase in the number of tyrosine hydroxylase immunopositive neurons 
would represent an increase in the number of dopaminergic neurons surviving in vitro. 
Therefore, if a polypeptide of the invention acts to prolong the survival of dopaminergic 
neurons, it would suggest that the polypeptide may be involved in Parkinson's Disease. 

The studies described in this example tested activity of a polypeptide of the 
invention. However, one skilled in the art could easily modify the exemplified studies to 
test the activity of polynucleotides (e.g., gene therapy), agonists, and/or antagonists of the 
invention. 

Example 36: The Effect of Pol ypeptides of the Invention on the Growth of Vascular 
Endothelial Cells 

On day 1, human umbilical vein endothelial cells (HUVEC) are seeded at 2-5xl0 4 
cells/35 mm dish density in M199 medium containing 4% fetal bovine serum (FBS), 16 
units/ml heparin, and 50 units/ml endothelial cell growth supplements (ECGS, 
Biotechnique, Inc.)- On day 2, the medium is replaced with Ml 99 containing 10% FBS, 8 
units/ml heparin. A polypeptide having the amino acid sequence of SEQ ID NO:Y, and 
positive controls, such as VEGF and basic FGF (bFGF) are added, at varying 
concentrations. On days 4 and 6, the medium is replaced. On day 8, cell number is 
determined with a Coulter Counter. 

An increase in the number of HUVEC cells indicates that the polypeptide of the 
invention may proliferate vascular endothelial cells. 
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The studies described in this example tested activity of a polypeptide of the 
invention. However, one skilled in the art could easily modify the exemplified studies to 
test the activity of polynucleotides (e.g., gene therapy), agonists, and/or antagonists of the 
invention. 

5 

Example 37: Stimulatory Effect of Polypeptides of the Invention on the Proliferation 
of Vascular Endothelial Ceils 

For evaluation of mitogenic activity of growth factors, the colorimetric MTS 
(3-(4,5-dimethylthiazol-2-yl)-5-(3-carboxymethoxyphenyl)-2-(4-sulfophenyl)2H- 

10 tetrazolium) assay with the electron coupling reagent PMS (phenazine methosulfate) was 
performed (CellTiter 96 AQ, Promega). Cells are seeded in a 96-well plate (5,000 
cells/well) in 0.1 mL serum-supplemented medium and are allowed to attach overnight. 
After serum-starvation for 12 hours in 0.5% FBS, conditions (bFGF, VEGF 165 or a 
polypeptide of the invention in 0.5% FBS) with or without Heparin (8 U/ml) are added to 

15 wells for 48 hours. 20 mg of MTS/PMS mixture (1:0.05) are added per well and allowed 
to incubate for 1 hour at 37°C before measuring the absorbance at 490 nm in an ELISA 
plate reader. Background absorbance from control wells (some media, no cells) is 
subtracted, and seven wells are performed in parallel for each condition. See, Leak et al 
In Vitro Cell Dev. Biol 50^:512-518 (1994). 

20 The studies described in this example tested activity of a polypeptide of the 

invention. However, one skilled in the art could easily modify the exemplified studies to 
test the activity of polynucleotides (e.g., gene therapy), agonists, and/or antagonists of the 
invention. 

25 Example 38: Inhibition of PDGF-induced Vascular Smooth Muscle Cell Proliferation 
Stimulatory Effect 

HAoSMC proliferation can be measured, for example, by BrdUrd incorporation. 
Briefly, subconfluent, quiescent cells grown on the 4-chamber slides are transfected with 
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CRP or FITC-labeled AT2-3LP. Then, the cells are pulsed with 10% calf serum and 6 
mg/ml BrdUrd. After 24 h, immunocytochemistry is performed by using BrdUrd Staining 
Kit (Zymed Laboratories). In brief, the cells are incubated with the biotinylated mouse 
anti-BrdUrd antibody at 4 degrees C for 2 h after being exposed to denaturing solution and 
then incubated with the streptavidin-peroxidase and diaminobenzidine. After 
counterstaining with hematoxylin, the cells are mounted for microscopic examination, and 
the BrdUrd-positive cells are counted. The BrdUrd index is calculated as a percent of the 
BrdUrd-positive cells to the total cell number. In addition, the simultaneous detection of 
the BrdUrd staining (nucleus) and the FITC uptake (cytoplasm) is performed for individual 
cells by the concomitant use of bright field illumination and dark field-UV fluorescent 
illumination. See, Hayashida et al., J. Biol. Chem, 6:271(36):21985-21992 (1996). 

The studies described in this example tested activity of a polypeptide of the 
invention. However, one skilled in the art could easily modify the exemplified studies to 
test the activity of polynucleotides (e.g., gene therapy), agonists, and/or antagonists of the 
invention. 

Example 39: Stimulation of Endoth elial Migration 

This example will be used to explore the possibility that a polypeptide of the 
invention may stimulate lymphatic endothelial cell migration. 

Endothelial cell migration assays are performed using a 48 well microchemotaxis 
chamber (Neuroprobe Inc., Cabin John, MD; Falk, W., et al., J. Immunological Methods 
1980;33:239-247). Polyvinylpyrrolidone-free polycarbonate filters with a pore size of 8 
urn (Nucleopore Corp. Cambridge, MA) are coated with 0.1% gelatin for at least 6 hours at 
room temperature and dried under sterile air. Test substances are diluted to appropriate 
concentrations in MI 99 supplemented with 0.25% bovine serum albumin (BSA), and 25 ul 
of the final dilution is placed in the lower chamber of the modified Boyden apparatus. 
Subconfluent, early passage (2-6) HUVEC or BMEC cultures are washed and trypsinized 
for the minimum time required to achieve cell detachment. After placing the filter between 
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lower and upper chamber, 2.5 x 10 5 cells suspended in 50 ul Ml 99 containing 1% FBS are 
seeded in the upper compartment. The apparatus is then incubated for 5 hours at 37°C in a 
humidified chamber with 5% C02 to allow cell migration. After the incubation period, the 
filter is removed and the upper side of the filter with the non-migrated cells is scraped with 
a rubber policeman. The filters are fixed with methanol and stained with a Giemsa 
solution (Diff-Quick, Baxter, McGraw Park, IL). Migration is quantified by counting cells 
of three random high-power fields (40x) in each well, and all groups are performed in 
quadruplicate. 

The studies described in this example tested activity of a polypeptide of the 
invention. However, one skilled in the art could easily modify the exemplified studies to 
test the activity of polynucleotides (e.g., gene therapy), agonists, and/or antagonists of the 
invention. 

Example 40: Stimulation of Nitric Oxide Production bv Endothelial Cells 

Nitric oxide released by the vascular endothelium is believed to be a mediator of 
vascular endothelium relaxation. Thus, activity of a polypeptide of the invention can be 
assayed by determining nitric oxide production by endothelial cells in response to the 
polypeptide. 

Nitric oxide is measured in 96-well plates of confluent microvascular endothelial 
cells after 24 hours starvation and a subsequent 4 hr exposure to various levels of a positive 
control (such as VEGF-1) and the polypeptide of the invention. Nitric oxide in the 
medium is determined by use of the Griess reagent to measure total nitrite after reduction 
of nitric oxide-derived nitrate by nitrate reductase. The effect of the polypeptide of the 
invention on nitric oxide release is examined on HUVEC. 

Briefly, NO release from cultured HUVEC monolayer is measured with a NO- 
specific polarographic electrode connected to a NO meter (Iso-NO, World Precision 
Instruments Inc.) (1049). Calibration of the NO elements is performed according to the 
following equation: 
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2 KN0 2 + 2 KI + 2 H 2 S0 4 6 2 NO + I 2 + 2 H 2 0 + 2 K 2 S0 4 
The standard calibration curve is obtained by adding graded concentrations of 
KN0 2 (0, 5, 10, 25, 50, 100, 250, and 500 nmol/L) into the calibration solution containing 
KI and H 2 S0 4 . The specificity of the Iso-NO electrode to NO is previously determined by 
measurement of NO from authentic NO gas (1050). The culture medium is removed and 
HUVECs are washed twice with Dulbecco's phosphate buffered saline. The cells are then 
bathed in 5 ml of filtered Krebs-Henseleit solution in 6-well plates, and the cell plates are 
kept on a slide warmer (Lab Line Instruments Inc.) To maintain the temperature at 37°C. 
The NO sensor probe is inserted vertically into the wells, keeping the tip of the electrode 2 
mm under the surface of the solution, before addition of the different conditions. S-nitroso 
acetyl penicillamin (SNAP) is used as a positive control. The amount of released NO is 
expressed as picomoies per lxlO 6 endothelial cells. All values reported are means of four 
to six measurements in each group (number of cell culture wells). See, Leak et al 
Biochem. and Biophys. Res. Comm. 277:96-105 (1995). 

The studies described in this example tested activity of polypeptides of the 
invention. However, one skilled in the art could easily modify the exemplified studies to 
test the activity of polynucleotides (e.g., gene therapy), agonists, and/or antagonists of the 
invention. 

Example 41: Effect of Foivpep ides of the Invention on Cord Formation in 
Angiogenesis 

Another step in angiogenesis is cord formation, marked by differentiation of 
endothelial cells. This bioassay measures the ability of microvascular endothelial cells to 
form capillary-like structures (hollow structures) when cultured in vitro. 

CADMEC (microvascular endothelial cells) are purchased from Cell Applications, 
Inc. as proliferating (passage 2) cells and are cultured in Cell Applications' CADMEC 
Growth Medium and used at passage 5. For the in vitro angiogenesis assay, the wells of a 
48-well cell culture plate are coated with Cell Applications* Attachment Factor Medium 
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(200 ml/well) for 30 min. at 37°C. CADMEC are seeded onto the coated wells at 7,500 
cells/well and cultured overnight in Growth Medium. The Growth Medium is then 
replaced with 300 mg Cell Applications' Chord Formation Medium containing control 
buffer or a polypeptide of the invention (0.1 to 100 ng/ml) and the cells are cultured for an 
additional 48 hr. The numbers and lengths of the capillary-like chords are quantitated 
through use of the Boeckeler VIA- 170 video image analyzer. All assays are done in 
triplicate. 

Commercial (R&D) VEGF (50 ng/ml) is used as a positive control, b-esteradiol (1 
ng/ml) is used as a negative control. The appropriate buffer (without protein) is also 
utilized as a control. 

The studies described in this example tested activity of a polypeptide of the 
invention. However, one skilled in the art could easily modify the exemplified studies to 
test the activity of polynucleotides (e.g., gene therapy), agonists, and/or antagonists of the 
invention. 

Example 42: Angiogenic Effect on Chick Chorioallantoic Membrane 

Chick chorioallantoic membrane (CAM) is a well-established system to examine 
angiogenesis. Blood vessel formation on CAM is easily visible and quantifiable. The 
ability of polypeptides of the invention to stimulate angiogenesis in CAM can be 
examined. 

Fertilized eggs of the White Leghorn chick {Gallus gallus) and the Japanese qual 
{Cotumix cotumix) are incubated at 37.8 °C and 80% humidity. Differentiated CAM of 
16-day-old chick and 13-day-old qual embryos is studied with the following methods. 

On Day 4 of development, a window is made into the egg shell of chick eggs. The 
embryos are checked for normal development and the eggs sealed with cellotape. They are 
further incubated until Day 13. Thermanox coversiips (Nunc, Naperville, IL) are cut into 
disks of about 5 mm in diameter. Sterile and salt-free growth factors are dissolved in 
distilled water and about 3.3 mg/ 5 ml are pipetted on the disks. After air-drying, the 
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inverted disks are applied on CAM. After 3 days, the specimens are fixed in 3% 
glutaraldehyde and 2% formaldehyde and rinsed in 0.12 M sodium cacodylate buffer. 
They are photographed with a stereo microscope [Wild M8] and embedded for semi- and 
ultrathin sectioning as described above. Controls are performed with carrier disks alone. 

The studies described in this example tested activity of a polypeptide of the 
invention. However, one skilled in the art could easily modify the exemplified studies to 
test the activity of polynucleotides (e.g., gene therapy), agonists, and/or antagonists of the 
invention. 

Example 43: Angiogenesis Assay Using a Matrip e l Imp lant in Mouse 

In vivo angiogenesis assay of a polypeptide of the invention measures the ability of 
an existing capillary network to form new vessels in an implanted capsule of murine 
extracellular matrix material (Matrigel). The protein is mixed with the liquid Matrigel at 4 
degree C and the mixture is then injected subcutaneously in mice where it solidifies. After 
7 days, the solid "plug" of Matrigel is removed and examined for the presence of new 
blood vessels. Matrigel is purchased from Becton Dickinson Labware/Collaborative 
Biomedical Products. 

When thawed at 4 degree C the Matrigel material is a liquid. The Matrigel is 
mixed with a polypeptide of the invention at 150 ng/ml at 4 degrees C and drawn into cold 
3 ml syringes. Female C57B1/6 mice approximately 8 weeks old are injected with the 
mixture of Matrigel and experimental protein at 2 sites at the midventral aspect of the 
abdomen (0.5 ml/site). After 7 days, the mice are sacrificed by cervical dislocation, the 
Matrigel plugs are removed and cleaned (i.e., all clinging membranes and fibrous tissue is 
removed). Replicate whole plugs are fixed in neutral buffered 10% formaldehyde, 
embedded in paraffin and used to produce sections for histological examination after 
staining with Masson's Trichrome. Cross sections from 3 different regions of each plug 
are processed. Selected sections are stained for the presence of vWF. The positive control 
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for this assay is bovine basic FGF (150 ng/ml). Matrigel alone is used to determine basal 
levels of angiogenesis. 

The studies described in this example tested activity of a polypeptide of the 
invention. However, one skilled in the art could easily modify the exemplified studies to 
test the activity of polynucleotides (e.g., gene therapy), agonists, and/or antagonists of the 
invention. 


Example 44: Rescue of Ischemia in Rabbit Lo w er Limh Model 

To study the in vivo effects of polynucleotides and polypeptides of the invention on 
ischemia, a rabbit hindlimb ischemia model is created by surgical removal of one femoral 
arteries as described previously (Takeshita et al.,Am J. Pathol 747:1649-1660 (1995)). 
The excision of the femoral artery results in retrograde propagation of thrombus and 
occlusion of the external iliac artery. Consequently, blood flow to the ischemic limb is 
dependent upon collateral vessels originating from the internal iliac artery (Takeshi taet al 
Am J. Pathol 747:1649-1660 (1995)). An interval of 10 days is allowed for post-operative 
recovery of rabbits and development of endogenous collateral vessels. At 10 day post- 
operatively (day 0), after performing a baseline angiogram, the internal iliac artery of the 
ischemic limb is transfected with 500 mg naked expression plasmid containing a 
polynucleotide of the invention by arterial gene transfer technology using a hydrogel- 
coated balloon catheter as described (Riessen et al Hum Gene Ther. 4:749-15$ (1993); 
Leclerc et al J. Clin. Invest 90: 936-944 (1992)). When a polypeptide of the invention is 
used in the treatment, a single bolus of 500 mg polypeptide of the invention or control is 
delivered into the internal iliac artery of the ischemic limb over a period of 1 min. through 
an infusion catheter. On day 30, various parameters are measured in these rabbits: (a) BP 
ratio - The blood pressure ratio of systolic pressure of the ischemic limb to that of normal 
limb; (b) Blood Flow and Flow Reserve - Resting FL: the blood flow during undilated 
condition and Max FL: the blood flow during fully dilated condition (also an indirect 
measure of the blood vessel amount) and Flow Reserve is reflected by the ratio of max FL: 
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resting FL; (c) Angiographic Score - This is measured by the angiogram of collateral 
vessels. A score is determined by the percentage of circles in an overlaying grid that with 
crossing opacified arteries divided by the total number m the rabbit thigh; (d) Capillary 
density - The number of collateral capillaries determined in light microscopic sections 
taken from hindlimbs. 

The studies described in this example tested activity of polynucleotides and 
polypeptides of the invention. However, one skilled in the art could easily modify the 
exemplified studies to test the agonists, and/or antagonists of the invention. 

Example 45: Effect of Polypeptid e s of the Invention on Vasodilation 

Since dilation of vascular endothelium is important in reducing blood pressure, the 
ability of polypeptides of the invention to affect the blood pressure in spontaneously 
hypertensive rats (SHR) is examined. Increasing doses (0, 10, 30, 100, 300, and 900 
mg/kg) of the polypeptides of the invention are administered to 13-14 week old 
spontaneously hypertensive rats (SHR). Data are expressed as the mean +/- SEM. 
Statistical analysis are performed with a paired t-test and statistical significance is defined 
as p<0.05 vs. the response to buffer alone. 

The studies described in this example tested activity of a polypeptide of the 
invention. However, one skilled in the art could easily modify the exemplified studies to 
test the activity of polynucleotides (e.g., gene therapy), agonists, and/or antagonists of the 
invention. 

Example 46: Rat Isch emic Skin Flap Model 

The evaluation parameters include skin blood flow, skin temperature, and factor 
VIII immunohistochemistry or endothelial alkaline phosphatase reaction. Expression of 
polypeptides of the invention, during the skin ischemia, is studied using in situ 
hybridization. 

The study in this model is divided into three parts as follows: 
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a) Ischemic skin 

b) Ischemic skin wounds 

c) Normal wounds 

The experimental protocol includes: 

a) Raising a 3x4 cm, single pedicle full-thickness random skin flap (myocutaneous 
flap over the lower back of the animal). 

b) An excisional wounding (4-6 mm in diameter) in the ischemic skin (skin-flap). 

c) Topical treatment with a polypeptide of the invention of the excisional wounds 
(day 0, 1, 2, 3, 4 post- wounding) at the following various dosage ranges: lmg to 100 mg. 

d) Harvesting the wound tissues at day 3, 5, 7, 10, 14 and 21 post-wounding for 
histological, immunohistochemical, and in situ studies. 

The studies described in this example tested activity of a polypeptide of the 
invention. However, one skilled in the art could easily modify the exemplified studies to 
test the activity of polynucleotides (e.g., gene therapy), agonists, and/or antagonists of the 
invention. 

Example 47: Peripheral Arterial Dis e ase Model 

Angiogenic therapy using a polypeptide of the invention is a novel therapeutic 
strategy to obtain restoration of blood flow around the ischemia in case of peripheral 
arterial diseases. The experimental protocol includes: 

a) One side of the femoral artery is ligated to create ischemic muscle of 
the hind limb, the other side of hindlimb serves as a control. 

b) a polypeptide of the invention, in a dosage range of 20 mg - 500 mg, is delivered 
intravenously and/or intramuscularly 3 times (perhaps more) per week for 2-3 weeks. 

c) The ischemic muscle tissue is collected after ligation of the femoral 

artery at 1,2, and 3 weeks for the analysis of expression of a polypeptide of the invention 
and histology. Biopsy is also performed on the other side of normal muscle of the 
contralateral hindlimb. 
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The studies described in this example tested activity of a polypeptide of the 
invention. However, one skilled in the art could easily modify the exemplified studies to 
test the activity of polynucleotides (e.g., gene therapy), agonists, and/or antagonists of the 
invention. 

Example 48: Ischemic Mvnr ardial Disease Model 

A polypeptide of the invention is evaluated as a potent mitogen capable of 
stimulating the development of collateral vessels, and restructuring new vessels after 
coronary artery occlusion. Alteration of expression of the polypeptide is investigated in 
situ. The experimental protocol includes: 

a) The heart is exposed through a left-side thoracotomy in the rat. Immediately, the 
left coronary artery is occluded with a thin suture (6-0) and the thorax is closed. 

b) a polypeptide of the invention, in a dosage range of 20 mg - 500 mg, is delivered 
intravenously and/or intramuscularly 3 times (perhaps more) per week for 2-4 weeks. 

c) Thirty days after the surgery, the heart is removed and cross-sectioned 
for morphometric and in situ analyzes. 

The studies described in this example tested activity of a polypeptide of the 
invention. However, one skilled in the art could easily modify the exemplified studies to 
test the activity of polynucleotides (e.g., gene therapy), agonists, and/or antagonists of the 
invention. 

Example 4 9: Rat Corneal Wound Healing Model 

This animal model shows the effect of a polypeptide of the invention on 
neovascularization. The experimental protocol includes: 

a) Making a 1-1.5 mm long incision from the center of cornea into the stromal 

layer. 
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b) Inserting a spatula below the lip of the incision facing the outer corner of 

the eye. 

c) Making a pocket (its base is 1 - 1 .5 mm form the edge of the eye). 

d) Positioning a pellet, containing 50ng- 5ug of a polypeptide of the invention, 
within the pocket. 

e) Treatment with a polypeptide of the invention can also be applied topically 
to the corneal wounds in a dosage range of 20mg - 500mg (daily treatment for five days). 

The studies described in this example tested activity of a polypeptide of the 
invention. However, one skilled in the art could easily modify the exemplified studies to 
test the activity of polynucleotides (e.g., gene therapy), agonists, and/or antagonists of the 
invention. 

Example 50: Diab etic Mou se and GlucQcorticoid-Tmpaired Wound Healing Models 
A. Diabetic db+/db+ Mouse Model. 

To demonstrate that a polypeptide of the invention accelerates the healing process, 
the genetically diabetic mouse model of wound healing is used. The full thickness wound 
healing model in the db+/db+ mouse is a well characterized, clinically relevant and 
reproducible model of impaired wound healing. Healing of the diabetic wound is 
dependent on formation of granulation tissue and re-epithelialization rather than 
contraction (Gartner, M.H. et al, J, Surg. Res. 52:389 (1992); Greenhalgh, D.G. et al, Am. 
J. Pathol. 136:1235 (1990)). 

The diabetic animals have many of the characteristic features observed in Type II 
diabetes mellitus. Homozygous (db+/db+) mice are obese in comparison to their normal 
heterozygous (db+/+m) littermates. Mutant diabetic (db+/db+) mice have a single 
autosomal recessive mutation on chromosome 4 (db+) (Coleman et al Proc. Natl Acad. 
Sci. USA 77:283-293 (1982)). Animals show polyphagia, polydipsia and polyuria. Mutant 
diabetic mice (db+/db+) have elevated blood glucose, increased or normal insulin levels, 
and suppressed cell-mediated immunity (Mandel et al, J. Immunol 120:1375 (1978); 
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Debray-Sachs, M. et al, Clin. Exp. Immunol 51(1 ):l-7 (1983); Leiter et aL, Am. J. of 
Pathol. 114:46-55 (1985)). Peripheral neuropathy, myocardial complications, and 
microvascular lesions, basement membrane thickening and glomerular filtration 
abnormalities have been described in these animals (Norido, F. et al., Exp. Neurol. 
83(2):22l-232 (1984); Robertson et al, Diabetes 29(1):60-61 (1980); Giacomelli et al, 
Lab Invest. 40(4):460-413 (1979); Coleman, D.L., Diabetes 31 (Suppl)A~6 (1982)). These 
homozygous diabetic mice develop hyperglycemia that is resistant to insulin analogous to 
human type II diabetes (Mandel et al, J. Immunol 120:131 5-1311 (1978)). 

The characteristics observed in these animals suggests that healing in this model 
may be similar to the healing observed in human diabetes (Greenhalgh, et al, Am. J. of 
Pathol 136: 1235-1246 (1990)). 

Genetically diabetic female C57BL/KsJ (db+/db+) mice and their non-diabetic 
(db+/+m) heterozygous littermates are used in this study (Jackson Laboratories). The 
animals are purchased at 6 weeks of age and are 8 weeks old at the beginning of the study. 
Animals are individually housed and received food and water ad libitum. All 
manipulations are performed using aseptic techniques. The experiments are conducted 
according to the rules and guidelines of Human Genome Sciences, Inc. Institutional 
Animal Care and Use Committee and the Guidelines for the Care and Use of Laboratory 
Animals. 

Wounding protocol is performed according to previously reported methods 
(Tsuboi, R. and Rifkin, D.B., J. Exp. Med. 772:245-251 (1990)). Briefly, on the day of 
wounding, animals are anesthetized with an intraperitoneal injection of Avertin (0.01 
mg/mL), 2,2,2-tribromoethanol and 2-methyl-2-butanol dissolved in deionized water. The 
dorsal region of the animal is shaved and the skin washed with 70% ethanol solution and 
iodine. The surgical area is dried with sterile gauze prior to wounding. An 8 mm full- 
thickness wound is then created using a Keyes tissue punch. Immediately following 
wounding, the surrounding skin is gently stretched to eliminate wound expansion. The 
wounds are left open for the duration of the experiment. Application of the treatment is 
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given topically for 5 consecutive days commencing on the day of wounding. Prior to 
treatment, wounds are gently cleansed with sterile saline and gauze sponges. 

Wounds are visually examined and photographed at a fixed distance at the day of 
surgery and at two day intervals thereafter. Wound closure is determined by daily 
measurement on days 1-5 and on day 8. Wounds are measured horizontally and vertically 
using a calibrated Jameson caliper. Wounds are considered healed if granulation tissue is 
no longer visible and the wound is covered by a continuous epithelium. 

A polypeptide of the invention is administered using at a range different doses, 
from 4mg to 500mg per wound per day for 8 days in vehicle. Vehicle control groups 
received 50mL of vehicle solution. 

Animals are euthanized on day 8 with an intraperitoneal injection of sodium 
pentobarbital (300mg/kg). The wounds and surrounding skin are then harvested for 
histology and immunohistochemistry. Tissue specimens are placed in 10% neutral 
buffered formalin in tissue cassettes between biopsy sponges for further processing. 

Three groups of 10 animals each (5 diabetic and 5 non-diabetic controls) are 
evaluated: 1) Vehicle placebo control, 2) untreated group, and 3) treated group. 

Wound closure is analyzed by measuring the area in the vertical and horizontal axis 
and obtaining the total square area of the wound. Contraction is then estimated by 
establishing the differences between the initial wound area (day 0) and that of post 
treatment (day 8). The wound area on day 1 is 64mm 2 , the corresponding size of the 
dermal punch. Calculations are made using the following formula: 

[Open area on day 8] - [Open area on day 1] / [Open area on day 1] 

Specimens are fixed in 10% buffered formalin and paraffin embedded blocks are 
sectioned perpendicular to the wound surface (5mm) and cut using a Reichert-Jung 
microtome. Routine hematoxylin-eosin (H&E) staining is performed on cross-sections of 
bisected wounds. Histologic examination of the wounds are used to assess whether the 

SUBSTITUTE SHEET (RULE 26) 


WO 00/29435 PCT/US99/25031 

552 

healing process and the morphologic appearance of the repaired skin is altered by 
treatment with a polypeptide of the invention. This assessment included verification of the 
presence of cell accumulation, inflammatory cells, capillaries, fibroblasts, re- 
epithelialization and epidermal maturity (Greenhalgh, D.G. et al, Am. J. Pathol 136:1235 
(1990)). A calibrated lens micrometer is used by a blinded observer. 

Tissue sections are also stained immunohistochemically with a polyclonal rabbit 
anti-human keratin antibody using ABC Elite detection system. Human skin is used as a 
positive tissue control while non-immune IgG is used as a negative control Keratinocyte 
growth is determined by evaluating the extent of reepithelialization of the wound using a 
calibrated lens micrometer. 

Proliferating cell nuclear antigen/cyclin (PCNA) in skin specimens is demonstrated 
by using anti-PCNA antibody (1:50) with an ABC Elite detection system. Human colon 
cancer can serve as a positive tissue control and human brain tissue can be used as a 
negative tissue control. Each specimen includes a section with omission of the primary 
antibody and substitution with non-immune mouse IgG. Ranking of these sections is based 
on the extent of proliferation on a scale of 0-8, the lower side of the scale reflecting slight 
proliferation to the higher side reflecting intense proliferation. 

Experimental data are analyzed using an unpaired t test. A p value of < 0.05 is 
considered significant. 


B. Steroid Impaired Rat Model 

The inhibition of wound healing by steroids has been well documented in various 
in vitro and in vivo systems (Wahl, Glucocorticoids and Wound healing. In: Anti- 
Inflammatory Steroid Action: Basic and Clinical Aspects. 280-302 (1989); Wahlef al, J. 
Immunol 115: 476-481 (1975); Werb et aL, J. Exp. Med. 747:1684-1694 (1978)). 
Glucocorticoids retard wound healing by inhibiting angiogenesis, decreasing vascular 
permeability (Ebert et aU An. Intern. Med. 57:701-705 (1952)), fibroblast proliferation, 
and collagen synthesis (Beck et al, Growth Factors. 5:295-304(1991); Haynes et al, J. 
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Clin, Invest. 61: 703-797 (1978)) and producing a transient reduction of circulating 
monocytes (Haynes et al., J. Clin. Invest. 61: 703-797 (1978); Wahl, "Glucocorticoids and 
wound healing", In: Antiinflammatory Steroid Action: Basic and Clinical Aspects, 
Academic Press, New York, pp. 280-302 (1989)). The systemic administration of steroids 
5 to impaired wound healing is a well establish phenomenon in rats (Beck et al, Growth 
Factors. 5: 295-304 (1991); Haynes et al, J. Clin. Invest. 61: 703-797 (1978); Wahl, 
"Glucocorticoids and wound healing", In: Antiinflammatory Steroid Action: Basic and 
Clinical Aspects, Academic Press, New York, pp. 280-302 (1989); Pierce et al, Proc. 
Natl. Acad. Sci. USA 86: 2229-2233 (1989)). 

10 To demonstrate that a polypeptide of the invention can accelerate the healing 

process, the effects of multiple topical applications of the polypeptide on full thickness 
excisional skin wounds in rats in which healing has been impaired by the systemic 
administration of methyl prednisolone is assessed. 

Young adult male Sprague Dawley rats weighing 250-300 g (Charles River 

15 Laboratories) are used in this example. The animals are purchased at 8 weeks of age and 
are 9 weeks old at the beginning of the study. The healing response of rats is impaired by 
the systemic administration of methylprednisolone (17mg/kg/rat intramuscularly) at the 
time of wounding. Animals are individually housed and received food and water ad 
libitum. All manipulations are performed using aseptic techniques. This study is 

20 conducted according to the rules and guidelines of Human Genome Sciences, Inc. 
Institutional Animal Care and Use Committee and the Guidelines for the Care and Use of 
Laboratory Animals. 

The wounding protocol is followed according to section A, above. On the day of 
wounding, animals are anesthetized with an intramuscular injection of ketamine (50 
25 mg/kg) and xylazine (5 mg/kg). The dorsal region of the animal is shaved and the skin 
washed with 70% ethanol and iodine solutions. The surgical area is dried with sterile 
gauze prior to wounding. An 8 mm full-thickness wound is created using a Keyes tissue 
punch. The wounds are left open for the duration of the experiment. Applications of the 
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testing materials are given topically once a day for 7 consecutive days commencing on the 
day of wounding and subsequent to methylprednisolone administration. Prior to treatment, 
wounds are gently cleansed with sterile saline and gauze sponges. 

Wounds are visually examined and photographed at a fixed distance at the day of 
wounding and at the end of treatment. Wound closure is determined by daily measurement 
on days 1-5 and on day 8. Wounds are measured horizontally and vertically using a 
calibrated Jameson caliper. Wounds are considered healed if granulation tissue is no longer 
visible and the wound is covered by a continuous epithelium. 

The polypeptide of the invention is administered using at a range different doses, 
from 4mg to 500mg per wound per day for 8 days in vehicle. Vehicle control groups 
received 50mL of vehicle solution. 

Animals are euthanized on day 8 with an intraperitoneal injection of sodium 
pentobarbital (300mg/kg). The wounds and surrounding skin are then harvested for 
histology. Tissue specimens are placed in 10% neutral buffered formalin in tissue cassettes 
between biopsy sponges for further processing. 

Four groups of 10 animals each (5 with methylprednisolone and 5 without 
glucocorticoid) are evaluated: 1) Untreated group 2) Vehicle placebo control 3) treated 
groups. 

Wound closure is analyzed by measuring the area in the vertical and horizontal axis 
and obtaining the total area of the wound. Closure is then estimated by establishing the 
differences between the initial wound area (day 0) and that of post treatment (day 8). The 
wound area on day 1 is 64mm 2 , the corresponding size of the dermal punch. Calculations 
are made using the following formula: 

[Open area on day 8] - [Open area on day 1] / [Open area on day 1] 

Specimens are fixed in 10% buffered formalin and paraffin embedded blocks are 
sectioned perpendicular to the wound surface (5mm) and cut using an Olympus 
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microtome. Routine hematoxylin-eosin (H&E) staining is performed on cross-sections of 
bisected wounds. Histologic examination of the wounds allows assessment of whether the 
healing process and the morphologic appearance of the repaired skin is improved by 
treatment with a polypeptide of the invention. A calibrated lens micrometer is used by a 
blinded observer to determine the distance of the wound gap. 

Experimental data are analyzed using an unpaired t test. A p value of < 0,05 is 
considered significant. 

The studies described in this example tested activity of a polypeptide of the 
invention. However, one skilled in the ait could easily modify the exemplified studies to 
test the activity of polynucleotides (e.g., gene therapy), agonists, and/or antagonists of the 
invention. 

Example 51: Lvmp hadema Animal Model 

or The purpose of this experimental approach is to create an appropriate and 
consistent lymphedema model for testing the therapeutic effects of a polypeptide of the 
invention in lymphangiogenesis and re-establishment of the lymphatic circulatory system 
in the rat hind limb. Effectiveness is measured by swelling volume of the affected limb, 
quantification of the amount of lymphatic vasculature, total blood plasma protein, and 
histopathology. Acute lymphedema is observed for 7-10 days. Perhaps more importantly, 
the chronic progress of the edema is followed for up to 3-4 weeks. 

Prior to beginning surgery, blood sample is drawn for protein concentration 
analysis. Male rats weighing approximately ~350g are dosed with Pentobarbital. 
Subsequently, the right legs are shaved from knee to hip. The shaved area is swabbed with 
gauze soaked in 70% EtOH. Blood is drawn for serum total protein testing. 
Circumference and volumetric measurements are made prior to injecting dye into paws 
after marking 2 measurement levels (0.5 cm above heel, at rnid-pt of dorsal paw). The 
intradermal dorsum of both right and left paws are injected with 0.05 ml of 1% Evan's 
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Blue. Circumference and volumetric measurements are then made following injection of 
dye into paws. 

Using the knee joint as a landmark, a mid-leg inguinal incision is made 
circumferentially allowing the femoral vessels to be located. Forceps and hemostats are 
used to dissect and separate the skin flaps. After locating the femoral vessels, the 
lymphatic vessel that runs along side and underneath the vessel(s) is located. The main 
lymphatic vessels in this area are then electrically coagulated suture ligated. 

Using a microscope, muscles in back of the leg (near the semitendinosis and 
adductors) are bluntly dissected. The popliteal lymph node is then located. The 2 
proximal and 2 distal lymphatic vessels and distal blood supply of the popliteal node are 
then and ligated by suturing. The popliteal lymph node, and any accompanying adipose 
tissue, is then removed by cutting connective tissues. 

Care is taken to control any mild bleeding resulting from this procedure. After 
lymphatics are occluded, the skin flaps are sealed by using liquid skin (Vetbond) (AJ 
Buck). The separated skin edges are sealed to the underlying muscle tissue while leaving a 
gap of -0.5 cm around the leg. Skin also may be anchored by suturing to underlying 
muscle when necessary. 

To avoid infection, animals are housed individually with mesh (no bedding). 
Recovering animals are checked daily through the optimal edematous peak, which 
typically occurred by day 5-7. The plateau edematous peak are then observed. To 
evaluate the intensity of the lymphedema, the circumference and volumes of 2 designated 
places on each paw before operation and daily for 7 days are measured. The effect plasma 
proteins on lymphedema is determined and whether protein analysis is a useful testing 
perimeter is also investigated. The weights of both control and edematous limbs are 
evaluated at 2 places. Analysis is performed in a blind manner. 

Circumference Measurements: Under brief gas anesthetic to prevent limb 
movement, a cloth tape is used to measure limb circumference. Measurements are done at 
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the ankle bone and dorsal paw by 2 different people then those 2 readings are averaged. 
Readings are taken from both control and edematous limbs. 

Volumetric Measurements: On the day of surgery, animals are anesthetized with 
Pentobarbital and are tested prior to surgery. For daily volumetrics animals are under brief 
halothane anesthetic (rapid immobilization and quick recovery), both legs are shaved and 
equally marked using waterproof marker on legs. Legs are first dipped in water, then 
dipped into instrument to each marked level then measured by Buxco edema 
software(Chen/Victor). Data is recorded by one person, while the other is dipping the limb 
to marked area. 

Blood-plasma protein measurements: Blood is drawn, spun, and serum separated 
prior to surgery and then at conclusion for total protein and Ca2+ comparison. 

Limb Weight Comparison: After drawing blood, the animal is prepared for tissue 
collection. The limbs are amputated using a quillitine, then both experimental and control 
legs are cut at the ligature and weighed. A second weighing is done as the tibio-cacaneal 
joint is disarticulated and the foot is weighed. 

Histological Preparations: The transverse muscle located behind the knee 
(popliteal) area is dissected and arranged in a metal mold, filled with freezeGel, dipped 
into cold methylbutane, placed into labeled sample bags at - 80EC until sectioning. Upon 
sectioning, the muscle is observed under fluorescent microscopy for lymphatics.. 

The studies described in this example tested activity of a polypeptide of the 
invention. However, one skilled in the art could easily modify the exemplified studies to 
test the activity of polynucleotides (e.g., gene therapy), agonists, and/or antagonists of the 
invention. 

Example 52: Suppression of TNF alpha-induced adhesion m olecule expression hv n 
Polypeptide of the Invention 

The recruitment of lymphocytes to areas of inflammation and angiogenesis 
involves specific receptor- ligand interactions between cell surface adhesion molecules 
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(CAMs) on lymphocytes and the vascular endothelium. The adhesion process, in both 
normal and pathological settings, follows a multi-step cascade that involves intercellular 
adhesion molecule-1 (ICAM-1), vascular cell adhesion molecule-1 (VCAM-1), and 
endothelial leukocyte adhesion molecule-1 (E-selectin) expression on endothelial cells 
(EC). The expression of these molecules and others on the vascular endothelium 
determines the efficiency with which leukocytes may adhere to the local vasculature and 
extravasate into the local tissue during the development of an inflammatory response. The 
local concentration of cytokines and growth factor participate in the modulation of the 
expression of these CAMs. 

Tumor necrosis factor alpha (TNF-a), a potent proinflammatory cytokine, is a 
stimulator of all three CAMs on endothelial cells and may be involved in a wide variety of 
inflammatory responses, often resulting in a pathological outcome. 

The potential of a polypeptide of the invention to mediate a suppression of TNF-a 
induced CAM expression can be examined. A modified ELISA assay which uses ECs as a 
solid phase absorbent is employed to measure the amount of CAM expression on TNF-a 
treated ECs when co-stimulated with a member of the FGF family of proteins. 

To perform the experiment, human umbilical vein endothelial cell (HUVEC) 
cultures are obtained from pooled cord harvests and maintained in growth medium (EGM- 
2; Clonetics, San Diego, CA) supplemented with 10% FCS and 1% penicillin/streptomycin 
in a 37 degree C humidified incubator containing 5% CO?. HUVECs are seeded in 96- 
well plates at concentrations of 1 x 10 4 cells/well in EGM medium at 37 degree C for 18- 
24 hrs or until confluent. The monolayers are subsequently washed 3 times with a serum- 
free solution of RPMI-1640 supplemented with 100 U/ml penicillin and 100 mg/ml 
streptomycin, and treated with a given cytokine and/or growth factor(s) for 24 h at 37 
degree C. Following incubation, the cells are then evaluated for CAM expression. 

Human Umbilical Vein Endothelial cells (HUVECs) are grown in a standard 96 
well plate to confluence. Growth medium is removed from the cells and replaced with 90 
ul of 199 Medium (10% FBS). Samples for testing and positive or negative controls are 
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added to the plate in triplicate (in 10 ul volumes). Plates are incubated at 37 degree C for 
either 5 h (selectin and integrin expression) or 24 h (integrin expression only). Plates are 
aspirated to remove medium and 100 pil of 0.1% paraformaldehyde-PBS(with Ca++ and 
Mg++) is added to each well. Plates are held at 4°C for 30 min. 
5 Fixative is then removed from the wells and welts are washed IX with 

PBS(+Ca,Mg)+0.5% BSA and drained. Do not allow the wells to dry. Add 10 fi\ of 
diluted primary antibody to the test and control wells. Anti-ICAM-l-Biotin, Anti-VCAM- 
1-Biotin and Anti-E-selectin-Biotin are used at a concentration of 10 /Ag/ml (1:10 dilution 
of 0.1 mg/ml stock antibody). Cells are incubated at 37°C for 30 min. in a humidified 

10 environment. Wells are washed X3 with PBS(+Ca,Mg)+0.5% BSA. 

Then add 20 fi) of diluted ExtrAvidin-Alkaline Phosphotase (1:5,000 dilution) to 
each well and incubated at 37 D C for 30 min. Wells are washed X3 with 
PBS(+Ca,Mg)+0.5% BSA. 1 tablet of p-Nitrophenol Phosphate pNPP is dissolved in 5 ml 
of glycine buffer (pH 10.4). 100 ^1 of pNPP substrate in glycine buffer is added to each 

15 test well. Standard wells in triplicate are prepared from the working dilution of the 
ExtrAvidin-Alkaline Phosphotase in glycine buffer: 1:5,000 (10°) > 10 05 > 10" 1 > 10 15 .5 
ji\ of each dilution is added to triplicate wells and the resulting AP content in each well is 
5.50 ng, 1.74 ng, 0.55 ng, 0.18 ng. 100 fx\ of pNNP reagent must then be added to each of 
the standard wells. The plate must be incubated at 37°C for 4h. A volume of 50 jttl of 3M 

20 NaOH is added to all wells. The results are quantified on a plate reader at 405 nm. The 
background subtraction option is used on blank wells filled with glycine buffer only. The 
template is set up to indicate the concentration of AP-conjugate in each standard well [ 
5.50 ng; 1.74 ng; 0.55 ng; 0.18 ng]. Results are indicated as amount of bound AP- 
conjugate in each sample. 

25 The studies described in this example tested activity of a polypeptide of the 

invention. However, one skilled in the art could easily modify the exemplified studies to 
test the activity of polynucleotides (e.g., gene therapy), agonists, and/or antagonists of the 
invention. 
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It will be clear that the invention may be practiced otherwise than as particularly 
described in the foregoing description and examples. Numerous modifications and 
variations of the present invention are possible in light of the above teachings and, 
therefore, are within the scope of the appended claims. 

The entire disclosure of each document cited (including patents, patent 
applications, journal articles, abstracts, laboratory manuals, books, or other disclosures) in 
the Background of the Invention, Detailed Description, and Examples is hereby 
incorporated herein by reference. Further, the hard copy of the sequence listing submitted 
herewith and the corresponding computer readable form are both incorporated herein by 
reference in their entireties. 

Example 53: Cloning, sequence analysi s and chromosomal localization of the novel 
human integrin alpha 11 subunit. 

Abstract 

The integrins are a large family of cell adhesion molecules consisting of noncovalently 
associated af3 heterodimers. We have cloned and sequenced a novel human integrin a - 
subunit cDNA, designated all. The all cDNA encodes a protein with a 22 amino acid 
signal peptide, a large 1120 residue extracellular domain that contains an I-domain of 207 
residues and is linked by a transmembrane domain to a short cytoplasmic domain of 24 
amino acids. The deduced all protein shows the typical structural features of integrin a- 
subunits and is similar to a distinct group of a-subunits from collagen -bin ding integrins. 
However, it differs from most integrin a-chains by an incompletetely preserved 
cytoplasmic GFFKR motif. The human ITGA11 gene was located to bands q22.3-23 on 
chromosome 15, and its transcripts were found predominantly in bone, cartilage as well as 
in cardiac and skeletal muscle. Expression of the 5.5 kilobase all mRNA was also 
detectable in ovary and small intestine. 
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Introduction 

All vertebrate cells express members of the integrin family of cell adhesion molecules, 
which mediate cellular adhesion to other cells and extracellular subtratum, cell migration 
and participate in important physiologic processes from signal transduction to cell 
proliferation and differentiation {Hynes, 92; Springer, 92}. Integrins are structurally 
homologous heterodimeric type-I membrane glycoproteins formed by the noncovalent 
association of one of eight (3 -subunits with one of the 17 different a-subunits described to 
date, resulting in at least 22 different a[3 complexes. Their binding specificities for cellular 
and extracellular ligands are determined by both subunits and are dynamically regulated in 
a cell-type-specific mode by the cellular environment as well as by the developmental and 
activation state of the cell {Diamond and Springer, 94}. In integrin a-subunits, the 
aminoterminal region of the large extracellular domain consists of a seven-fold repeated 
structure which is predicted to fold into a (5 -propeller domain {Corbi et aL, 1987; 
Springer, 1997}. The three or four C-terminal repeats contain putative divalent cation 
binding motifs that are thought to be important for ligand binding and subunit association 
{Diamond and Springer, 94}, The ct 1 , a 2 , a ,0 , a D , a E , a L , a M and cx x -subunits contain an 
approximately 200 amino acid I-domain inserted between the second and third repeat that 
is not present in other a-chains {Larson et aL, 1989}. Several isolated I-domains have been 
shown to independently bind the ligands of the parent integrin heterodimer {Kamata and 
Takada, 1994; Randi and Hogg, 1994}. The a\ a 5 ' 8 , a ,lb and a v -subunits are 
proteolytically processed at a conserved site into disulphide-iinked heavy and light chains, 
while the a 4 -subunit is cleaved at a more aminoterminal site into two fragments that 
remain noncovalently associated {Hemler et aL, 90}. Additional cc-subunit variants are 
generated by alternative splicing of primary transcripts {Ziober et aL, 93; Delwel et al. y 95; 
Leung et aL, 98}. The extracellular domains of a-integrin subunits are connected by a 
single spanning transmembrane domain to short, diverse cytoplasmic domains whose only 
conserved feature is a membrane-proximal KXGFF(K7R)R motif {Sastry and Horwitz, 
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1993}. The cytoplasmic domains have been implicated in the cell-type-specific modulation 
of integrin affinity states {Williams et aL, 1994 }. 

Here we report the cDNA cloning, sequence analysis, expression and chromosomal 
5 localization of the human a-integrin subunit. 

Materials and Methods 

Library screening and DNA sequencing. 

A human fetal heart cDNA library in XgtlO (Clontech Laboratories, Inc., Palo Alto, CA, 
10 USA) was screened with 32 P-labelled (rediprime, Amersham New Zealand Ltd., Auckland, 
New Zealand) probes corresponding to the regions 473 to 749 and 2394 to 3189 of the al 1 
cDNA using standard procedures. Inserts were subcloned from A-gtlO into pUC21 and 
sequenced on both strands according to a successive specific primer strategy on an 
automated sequencer (Applied Biosystems 373A, The Centre for Gene Technology, School 
15 of Biological Sciences, The University of Auckland). 

Northern Blot Analysis and Tissue distribution. 

A 1341bp PCR fragment corresponding to the region 351-1692 of the a cDNA was 32 P- 
labelled (rediphme) and hybridized with human multiple tissue Northern blots (MTN I and 

20 MTN II, Clontech) for 16h at 60°°C in ExpressHyb solution (Clontech). Filters were 
washed twice with 0.1xSSC/l% SDS at 50coC for 30min, and autoradiographed. Human 
DNA from 63 tissue-specific cDNA libraries (Express-Check™, American Type Culture 
Collection, Manassas, VA, USA) was amplified using primers KL120 (5'- 
GCAGGGATGCCACCTGCC) and KL119 (5'-GATGAAGACTGTGGTGTCGAAGG) 

25 according to the manufacturers instructions. PCR-products were resolved by agarose gel 
electrophoresis and transferred to Hybond C+ (Amersham). Filters were hybridized by 
standard procedures {Ausubel et al., 98} with a 502bp 32 P-labelled (rediprime) probe 
fragment obtained from the cloned 1,11 cDNA with the same oligonucleotides. 
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Chromosomal assignment. 

500ng genomic DNA prepared from a panel of 21 human-rodent somatic cell hybrids or 
from human, mouse and hamster cells {Kelsell et al, 95} was amplified with 
oligonuleotides KL175 (5'-GGTGCCAGACCTACATGGAC) and KL189 (5 - 
CGTGCAAATTCAATGCCAAATGCC) in a standard PCR reaction of 30 cycles (94oo C 
for lmin, 55«C for lmin, 72~c for 2min). All PCR reactions were resolved in a 2% 
agarose gel. Southern hybridization was performed as detailed above, except that the probe 
fragment was obtained from clone HOHBY69 with oligonucleotides KL175 and KL189. 
For fluorescent in situ hybridization, metaphase spreads were prepared from 
phytohemagglutinin-stimulated peripheral blood lymphocytes of a 46,XY male donor 
using standard cytogenetic procedures. A purified 3.7kB fragment representing the entire 
coding region of clone HOHBY69 was labelled with biotin-16-dUTP using the High Prime 
labelling kit (Roche Molecular Biochemicals, Auckland, NZ). Conditions for hybridization 
and immunofluorescent detection were essentially as described {Morris et al, 93}, except 
that C 0 t-1 suppression was not required, slides were washed to a stringency of 
0,lxSSC/60<«C after hybridization, and an additional amplification step was needed 
because of the small size of the probe. For precise chromosome band localization, DAPI 
and FITC images were captured using a Photometries KAF1400 CCD camera and QUIPS 
Smartcapture FISH software version 1.3 (Vysis Inc., Downers Grove, IL, USA). QUIPS 
CGH/Karyotyping software (version 3.0.2) assisted karyotype analysis. 

Results 

Cloning of a novel human a-integrin subunit cDNA: 

A protein homology search {Altschul et aL, 90} of the human expressed sequence tag 
(EST) databases of Human Genome Sciences, Inc. {Ni et aL, 97} and The Institute for 
Genomic Research {Kirkness and Kerlavage, 97} identified the clones HRDAF83 and 
HOEAM34 as candidate novel integrin a -subunit cDNAs. Clone HRDAF83 was isolated 
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from a human rhabdomyosarcoma cDNA library and sequenced on both strands. The 
1223bp insert contains largely incompletely processed hnRNA and a 277bp region that 
showed homology to the aminoterminal half of the ccl -integrin I-domain . The 2517bp 
insert of clone HOEAM34 was derived from a human osteoblast cDNA library. It is 
5 homologous to the C-terminal part of the human ocl-subunit and contains 1324 nucleotides 
of 3 '-untranslated region . In order to isolate the full-length cDNAs for these integrin ct- 
subunits, a cDNA library prepared from human fetal heart in XgtlO was screened with the 
277bp fragment from clone HRDAF83 homologous to the al-I-domain. Two clones, X831 
and A,S32, were isolated and both strands of their inserts sequenced. Clone XS32 contains 

10 the entire 5' half of a novel a-subunit cDNA, while clone X&31 covers the same region, but 
is 358bp and 173bp shorter than X831 at its 5'- and 3'-ends, respectively. A screening of the 
same library with a 795bp fragment from the extreme 5' terminus of clone HOEAM34 
identified clone A342, which contained essentially the same region as clone HOEAM34 
but has a 317bp shorter 3 '-untranslated region . Rescreening the EST databases with the 

15 sequences derived from the human fetal heart library led to the identification of clone 
HOHBY69, which was isolated from a osteoblast cDNA libray. Both strands of the 4681bp 
insert of clone HOHBY69 were sequenced. The 5'-region of HOHBY69 was identical to 
the HRDAF83/A832/A,831 -group, while the 3*-region of HOHBY69 was largely identical 
to HOEAM34 and A342, thereby demonstrating that the two groups of partial cDNAs 

20 represent the 5- and 3'-portions of the same cDNA . One major difference between the 
HOHBY69 and HOEAM34A342 is the presence of an additional GTA-triplet at position 
3088 in HOHBY69. From the overlapping clones, a total of 4986bp of cDNA was 
assembled to the composite sequence shown in Figure 2 and has been submitted to 
GenBank™ with accession number AF109681. This cDNA encodes a previously 

25 unidentifed human integrin a-subunit that was designated otl 1. 
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Structure of the human al 1-subunit. 

The all cDNA contains a 5'-untranslated region of 72 nucleotides and a single open 
reading frame extending from a predicted translation initiation codon at position +1 to a 
TGA termination codon at position 3570 . This is followed by 1324 nucleotides of 3' 
untranslated region which contains an AATTAAA polyadenylation signal {Wahle and 
Keller, 1996} 12 nucleotides upstream of a poly(A) stretch. The deduced amino acid 
sequence contains a 22 residue N-terminal region with the characteristics of a cleaved 
signal peptide {von Heijne, 83; Nielsen et al, 97}, a large extracellular domain of 1120 
amino acids followed by a 23 amino acid hydrophobic stretch that resembles a 
transmembrane domain, and a short 24 residue cytoplasmic domain. The molecular weight 
of the mature 1 167 amino acid ccll-subunit is predicted to be 131 kDa, but the addition of 
carbohydrate side chains to any of the 15 potential N-glycosylation sequons [NX(S/T)] 
within the extracellular domain is likely to increase the molecular weight of the native 
protein. An I-domain of 207 amino acids is inserted between the second and third repeat . 
Consistent with the structure of an typical I-domain-containing integrin a-subunit, it lacks 
a potential dibasic protease cleavage site in the C-terminal region of the extracellular 
domain. 

The al 1-subunit is most closely related to the recently discovered alO-subunit (Camper et 
at,, 98, Lehnert et al., in preparation} and the al- and a2-subunits. Overall, the mature 
all-protein is 45% identical to the alO chain, while the homologies to the al- and a2- 
subunits are 41% and 39%, respectively. Even greater homology exists between the I- 
domains of the alO- and all-subunits which are 60% identical to each other. The high 
degree of homology seen in the extracellular domains of the subunits is in contrast to the 
low similarity of their cytoplasmic domains. Interestingly, the KXGFF(K/R)R motif that is 
absolutely conserved in all other a-subunit cytoplasmic domains is only partially preserved 
in both subunits. The sequence in all is KLGFFRS, while the alO-subunit contains a 
KLGFFAH motif. A graphical comparison of the similarity between all integrin a-subunits 
is shown in Fig. 3. Together with the a-subunits from the collagen-binding integrins al(3l, 
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a2piand alOPl, the all-subunit forms a group distinct from the other I-domain- 
containing integrin subunits. 

Tissue distribution and expression of the integrin all-subunit. 

The tissue distribution of the all mRNA was assessed by screening multiple human tissue 
Northern blots with a probe corresponding to the region 351-1692 of the all cDNA. A 
single transcript of approximately 5.5kb was found weakly expressed only in ovary and 
small intestine . Integrin all-subunit expression was further analyzed by amplification 
and Southern hybridization of a 502bp fragment corresponding to the region 1988-2490 in 
the all cDNA from tissue- specific human cDNA libraries, all cDNA was detected in 
five different cDNA libraries prepared from fetal heart (day 57-75), in two fetal brain 
libraries, and in a cDNA library from large intestine (not shown). An analysis of the 
Human Genome Sciences Database revealed eight different all -related ESTs in human 
osteoblast libraries, three EST in a human chondrosarcoma cDNA library and two EST in a 
human stromal osteoclastoma library. 

Chromosomal localization of the integrin all-subunit. 

Genomic DNA from a collection af 21 human-rodent somatic cell hybrids {Kelsell et ah, 
95 } was amplified by PCR using oligonucleotide primers directed the the region 473 to 
749 of the human all cDNA. In Southern hybridization, a signal corresponding to a l,4kb 
fragment was detectable only with DNA from a hybrid cell line that contains human 
chromosome 15. A fragment of the same size was also amplified from human genomic 
DNA, but not from mouse or hamster DNA (Fig. 5C). Cloning and sequence of the PCR 
product from chromosome 15 revealed the presence of a 1154bp intron inserted after 
cDNA-position 600, thus resulting in a PCR-product of 143 lbp. The ITGA11 gene was 
also localized by fluorescent in situ hybridization of metaphase chromosomes with the 
entire coding region from clone HOHBY69. All of 20 metaphase cells analyzed showed 
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fluorescent signal on both chromosomes 15, specifically across bands q22.3-q23. No 
additional signals were detected on any other chromosome (Fig. 5A). 

Discussion: 

We have cloned and sequenced a novel cDNA encoding a protein that shares extensive 
structural homology with integrin ot-chains. The aminotermtnal 22 amino acids of the 
deduced protein sequence show the characteristic features of a hydrophobic leader peptide, 
including a signal peptidase recognition motif at positions -3 and -1 {von Heijne 83}. 
Proteolytic cleavage of the precursor protein at this position would result in an 
aminoterminal sequence for the mature all-chain of FNMD, which is similar to the 
consensus sequence [(F/Y)N(L/V)D] of all other integrin all-subunits {Tuckwell et aL, 
94}. The N-terminal half of the large extracellular region of all is composed of seven 
repeats that each contain FG— GAP— GxxY consensus motifs (FG-GAP repeats). These 
repeats can be found in all integrin al-subunits and are predicted to fold into a (3-propeller 
domain {Springer, 97}. Inserted between the second and third FG-GAP repeats is a 207 
amino acid I~domain spanning from glutamine 138 to methionine 344 . It contains a divalent 
cation coordination motif that has been shown to directly bind Mg 2+ ions in the cc M subunit 
{Michishita et al, 96}. The noncontiguous amino acid side chains involved in the 
coordination of magnesium or manganese ions have been identified by mutagenesis 
analysis and from crystal structures of the isolated or, a L and a M -subunit I-domains 
{Emsley et aL, 97; Qu and Leahy, 95; Lee et aL, 95; }. All residues required for the 
coordination of the divalent cations in these subunits are preserved in the all-I-domain. 
These are the asparagines at positions 148 and 249, the serine residues at position 150 and 
152, and the threonine at position 218. 

The crystal sctructure of the a2-subunit has revealed a small "-helix that is not present in 
the I-domains of the p2-associated ct-subunits. Together with the MIDAS sphere, amino 
acid residues from this C-helix and the adjacent turn region have been proposed to make 
physical contacts to a collagen triple helix {Emsley et aL, 97}. Interestingly, the small C- 
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helix is structurally conserved in the a-subunits of the collagen-binding integrins alj3l, 
a2(3land al0(3l, and is also present in the cclll-domain (G 279 YYNR 283 ). In addition, 
asparagine 154 and histidine 258 of the a2-I-domain were predicted to contact the collagen 
triple helix, and both are preserved in the ctl, otlO and a 11 -I- domains, but not in other 
integrin ccl 1-subunits. The conservation of structural motifs required for collagen binding 
suggests that collagen may be a ligand for the al 1 integrin. Each of the repeats 5-7 of the 
all-subunit accomodates the sequence Dx(D/N)xDxxxD. Three or four copies of these 
putative divalent cation binding sites are conserved in all integrin a-subunits and their 
presence is consistent with the divalent cation requirement for the adhesive function of 
integrins {Larson et al, 89; Fujimura and Phillips, 83; Hynes, 92}. The extracellular 
domain of the integrin all-subunit contains 20 cysteine residues. Only the intramolecular 
disulfide bonds in the ,,llb subunit have been biochemically characterized {Calvete et al, 
89}, but the location of many cysteines is conserved in integrin a-subunits. In the all- 
subunit, the cysteine residues 637 and 646, 652 and 707, 759 and 765, and 859 and 871 are 
homologous to the residues that form the four carboxyterminal disulfide bonds in the 
heavy chain of " llb {Calvete et al, 89}. Based on the proposed structure of the integrin a- 
subunit propeller domain {Springer et ah, 97}, additional disulfide bonds within the all 
subunit can be predicted between cysteine residues 54 and 61, 99 and 117, and between 
107 and 137. Two additional cysteine residues are found within a short segment (residues 
783 to 798) that is unique to the all-subunit. 

The integrin cytoplasmic domains play central roles in integrin affinity modulation and in 
cellular signal transmission. The membrane-proximal sequence KxGFF(K7R)R is strictly 
conserved among integrin a-subunit cytoplasmic domains {Williams et al, 94}. Within 
this motif, both phenylalanine residues and the last arginine have been implicated in 
maintaining the default low affinity state of integrins a L |3 2 and a Ilb P 3 , as their substitution 
or deletion resulted in constitutively activated ligand binding {O'Toole et al, 94, Lu and 
Spriner 97}. Interestingly, the last arginine residue is replaced by a serine in the all 
cytoplasmic domain and with a histidine in the alO subunit, suggesting that both integrins 
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might be in a default "high" affinity state. It will be interesting to analyze whether 
substitution of these residues with a conserved arginine will affect their affinity status. 
We have isolated all cDNAs from osteoclast, osteoblast, myosarcoma and fetal heart 
libraries. Amongst the HGS EST databases, integrin all transcripts were predominantly 
5 found in libraries prepared from osteoblast, osteoclast and chondrosarcoma cells. A search 
for further Ml '-related sequences in the EST division of the GenBank database revealed two 
clones (accession numbers Z50157 and Z50167) from primary human myoblasts {Genini 
et al, 96}, two clones from human trabecular bone cells (AA852614 and AA852615), as 
well as clones from fibroblast cells (W45078), pancreatic tumor (U53091) and breast tissue 

10 (HI 61 12). In contrast, Northern blot analysis detected al 1 -expression only in ovary and 
small intestine, only fetal heart, fetal brain and large intestine. Of the tissues represented in 
the tissue-specific cDNA-library panel, only fetal heart, fetal brain and large intestine 
showed detectable all-expression. However, bone- and muscle-derived tissues were not 
included in the Northern blot, and cDNA libraries prepared from these tissues were also 

15 not represented in the tissue-specific cDNA panel. 

The ITGA1 igene was localized to chromosome 15, bands q22.3-23, by FISH and 
PCR analysis of human-rodent somatic cell hybrids. This segment is overrepresented in 
squamous cell carcinomas {Wolff et al, 1998}, but appears to only infrequently affected in 
other cancers. Genes at this region encode neogenin, a protein expressed ubiquitously 

20 expressed in human tissues {Meyerhardt et al, 97}; tropomyosin 1, expressed in cardiac 
and skeletal muscle tissues {Tiso et al, 97}; and the human homologue of the 
metalloprotease-disintegrin kuzbanian, which is overexpressed in tumors of 
sympathoadrenal origin { Yavari et al, 98}. In addition, the region 15q22.3-q23 is linked to 
Bardet-Biedl syndrome 4, a heterogeneous autosomal disorder characterized by obesity 

25 and associated with cardiovascular anomalities {Carmi et al., 95). 


In conclusion, we have cloned and sequenced the cDNA for the novel integrin al 1-subunit 
which is closely related to the "-subunits of the collagen-binding integrins aipi, a2pi and 
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cclOpi. The high degree of homology of all to these subunits suggests that it associates 
with the integrin pl-subunit, and may function as an additional collagen receptor. 
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CANADA 

The applicant requests that, until either a Canadian patent has been issued on the basis of an 
application or the application has been refused, or is abandoned and no longer subject to 
reinstatement, or is withdrawn, the Commissioner of Patents only authorizes the furnishing of 
a sample of the deposited biological material referred to in the application to an independent 
expert nominated by the Commissioner, the applicant must, by a written statement, inform 
the Internationa] Bureau accordingly before completion of technical preparations for 
publication of the international application. 

NORWAY 

The applicant hereby requests that the application has been laid open to public inspection (by 
the Norwegian Patent Office), or has been finally decided upon by the Norwegian Patent 
Office without having been laid open inspection, the furnishing of a sample shall only be 
effected to an expert in the art. The request to this effect shall be filed by the applicant with 
the Norwegian Patent Office not later than at the time when the application is made available 
to the public under Sections 22 and 33(3) of the Norwegian Patents Act. If such a request has 
been filed by the applicant, any request made by a third party for the furnishing of a sample 
shall indicate the expert to be used. That expert may be any person entered on the list of 
recognized experts drawn up by the Norwegian Patent Office or any person approved by the 
applicant in the individual case. 

AUSTRALIA 

The applicant hereby gives notice that the furnishing of a sample of a microorganism shall 
only be effected prior to the grant of a patent, or prior to the lapsing, refusal or withdrawal of 
the application, to a person who is a skilled addressee without an interest in the invention 
(Regulation 3.25(3) of the Australian Patents Regulations). 

FINLAND 

The applicant hereby requests that, until the application has been laid open to public 
inspection (by the National Board of Patents and Regulations), or has been finally decided 
upon by the National Board of Patents and Registration without having been laid open to 
public inspection, the furnishing of a sample shall only be effected to an expert in the art. 

UNITED KINGDOM 

The applicant hereby requests that the furnishing of a sample of a microorganism shall only 
be made available to an expert. The request to this effect must be filed by the applicant with 
the International Bureau before the completion of the technical preparations for the 
international publication of the application. 


SUBSTITUTE SHEET (RULE 26) 


WO 00/29435 


PCT/US99/25031 


577 


ATCC Deposit No. 203484 


DENMARK 

The applicant hereby requests that, until the application has been laid open to public 
inspection (by the Danish Patent Office), or has been finally decided upon by the Danish 
Patent office without having been laid open to public inspection, the furnishing of a sample 
shall only be effected to an expert in the art. The request to this effect shall be filed by the 
applicant with the Danish Patent Office not later that at the time when the application is made 
available to the public under Sections 22 and 33(3) of the Danish Patents Act. If such a 
request has been filed by the applicant, any request made by a third party for the furnishing of 
a sample shall indicate the expert to be used. That expert may be any person entered on a list 
of recognized experts drawn up by the Danish Patent Office or any person by the applicant in 
the individual case. 

SWEDEN 

The applicant hereby requests that, until the application has been laid open to public 
inspection (by the Swedish Patent Office), or has been finally decided upon by the Swedish 
Patent Office without having been laid open to public inspection, the furnishing of a sample 
shall only be effected to an expert in the art. The request to this effect shall be filed by the 
applicant with the International Bureau before the expiration of 16 months from the priority 
date (preferably on the Form PCT/RO/134 reproduced in annex Z of Volume I of the PCT 
Applicant's Guide). If such a request has been filed by the applicant any request made by a 
third party for the furnishing of a sample shall indicate the expert to be used. That expert 
may be any person entered on a list of recognized experts drawn up by the Swedish Patent 
Office or any person approved by a applicant in the individual case. 

NETHERLANDS 

The applicant hereby requests that until the date of a grant of a Netherlands patent or until the 
date on which the application is refused or withdrawn or lapsed, the microorganism shall be 
made available as provided in the 31F(1) of the Patent Rules only by the issue of a sample to 
an expert. The request to this effect must be furnished by the applicant with the Netherlands 
Industrial Property Office before the date on which the application is made available to the 
public under Section 22C or Section 25 of the Patents Act of the Kingdom of the Netherlands, 
whichever of the two dates occurs earlier. 
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1. An isolated nucleic acid molecule comprising a polynucleotide having a 
nucleotide sequence at least 95% identical to a sequence selected from the group consisting 
of: 

(a) a polynucleotide fragment of SEQ ID NO:X or a polynucleotide fragment of 
the cDNA sequence included in ATCC Deposit No:Z, which is hybridizable to SEQ ID 
NO:X; 

(b) a polynucleotide encoding a polypeptide fragment of SEQ ID NO:Y or a 
polypeptide fragment encoded by the cDNA sequence included in ATCC Deposit No:Z, 
which is hybridizable to SEQ ID NO:X; 

(c) a polynucleotide encoding a polypeptide domain of SEQ ID NO:Y or a 
polypeptide domain encoded by the cDNA sequence included in ATCC Deposit No:Z, 
which is hybridizable to SEQ ID NO:X; 

(d) a polynucleotide encoding a polypeptide epitope of SEQ ID NO:Y or a 
polypeptide epitope encoded by the cDNA sequence included in ATCC Deposit No:Z, 
which is hybridizable to SEQ ID NO:X; 

(e) a polynucleotide encoding a polypeptide of SEQ ID NO:Y or the cDNA 
sequence included in ATCC Deposit No:Z, which is hybridizable to SEQ ID NO:X, having 
biological activity; 

(f) a polynucleotide which is a variant of SEQ ID NO:X; 

(g) a polynucleotide which is an allelic variant of SEQ ID NO:X; 

(h) a polynucleotide which encodes a species homologue of the SEQ ID NO:Y; 

(i) a polynucleotide capable of hybridizing under stringent conditions to any one of 
the polynucleotides specified in (a)-(h) t wherein said polynucleotide does not hybridize 
under stringent conditions to a nucleic acid molecule having a nucleotide sequence of only 
A residues or of only T residues. 

2. The isolated nucleic acid molecule of claim 1, wherein the polynucleotide 
fragment comprises a nucleotide sequence encoding a secreted protein. 

3. The isolated nucleic acid molecule of claim 1, wherein the polynucleotide 
fragment comprises a nucleotide sequence encoding the sequence identified as SEQ ID 
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NO:Y or the polypeptide encoded by the cDNA sequence included in ATCC Deposit 
No:Z, which is hybridizable to SEQ ID NO:X. 

4. The isolated nucleic acid molecule of claim 1 , wherein the polynucleotide 
fragment comprises the entire nucleotide sequence of SEQ ID NO:X or the cDNA 
sequence included in ATCC Deposit No:Z, which is hybridizable to SEQ ID NO:X. 

5. The isolated nucleic acid molecule of claim 2, wherein the nucleotide 
sequence comprises sequential nucleotide deletions from either the C-terminus or the N- 
terminus. 

6. The isolated nucleic acid molecule of claim 3, wherein the nucleotide 
sequence comprises sequential nucleotide deletions from either the C-terminus or the N- 
terminus. 

7. A recombinant vector comprising the isolated nucleic acid molecule of 
claim 1. 

8. A method of making a recombinant host cell comprising the isolated nucleic 
acid molecule of claim 1. 

9. A recombinant host ceil produced by the method of claim 8. 

10. The recombinant host cell of claim 9 comprising vector sequences. 

11. An isolated polypeptide comprising an amino acid sequence at least 95% 
identical to a sequence selected from the group consisting of: 

(a) a polypeptide fragment of SEQ ID NO:Y or the encoded sequence included in 
ATCC Deposit No:Z; 

(b) a polypeptide fragment of SEQ ID NO:Y or the encoded sequence included in 
ATCC Deposit No:Z, having biological activity; 

(c) a polypeptide domain of SEQ ID NO:Y or the encoded sequence included in 
ATCC Deposit No:Z; 
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(d) a polypeptide epitope of SEQ ID NO:Y or the encoded sequence included in 
ATCC Deposit No:Z; 

(e) a secreted form of SEQ ID NO:Y or the encoded sequence included in ATCC 
Deposit No:Z; 

(f) a full length protein of SEQ ID NO:Y or the encoded sequence included in 
ATCC Deposit No:Z; 

(g) a variant of SEQ ID NO: Y ; 

(h) an allelic variant of SEQ ID NO:Y ; or 

(i) a species homologue of the SEQ ID NO:Y. 

12. The isolated polypeptide of claim 11, wherein the secreted form or the full 
length protein comprises sequential amino acid deletions from either the C-terminus or the 
N-terminus. 

13. An isolated antibody that binds specifically to the isolated polypeptide of 
claim II. 

14. A recombinant host cell that expresses the isolated polypeptide of claim 1 1 . 

15. A method of making an isolated polypeptide comprising: 

(a) culturing the recombinant host cell of claim 14 under conditions such that said 
polypeptide is expressed; and 

(b) recovering said polypeptide. 

16. The polypeptide produced by claim 15. 

17. A method for preventing, treating, or ameliorating a medical condition, 
comprising administering to a mammalian subject a therapeutically effective amount of the 
polypeptide of claim 1 1 or the polynucleotide of claim 1. 

18. A method of diagnosing a pathological condition or a susceptibility to a 
pathological condition in a subject comprising: 

(a) determining the presence or absence of a mutation in the polynucleotide of 
claim 1 ; and 
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(b) diagnosing a pathological condition or a susceptibility to a pathological 
condition based on the presence or absence of said mutation. 

19. A method of diagnosing a pathological condition or a susceptibility to a 
pathological condition in a subject comprising: 

(a) determining the presence or amount of expression of the polypeptide of claim 
1 1 in a biological sample; and 

(b) diagnosing a pathological condition or a susceptibility to a pathological 
condition based on the presence or amount of expression of the polypeptide. 

20. A method for identifying a binding partner to the polypeptide of claim 1 1 
comprising: 

(a) contacting the polypeptide of claim 1 1 with a binding partner; and 

(b) determining whether the binding partner effects an activity of the polypeptide. 

21 . The gene corresponding to the cDNA sequence of SEQ ID NO: Y. 

22. A method of identifying an activity in a biological assay, wherein the 
method comprises: 

(a) expressing SEQ ID NO:X in a cell; 

(b) isolating the supernatant; 

(c) detecting an activity in a biological assay; and 

(d) identifying the protein in the supernatant having the activity. 

23. The product produced by the method of claim 20. 
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FIG. 1A 

1 GCGTCCGGGCCAAAATGCTGAGAACGTCCACTCCTAATCTGTGTGGTGGTCTGCATTGCC 6 0 

6 1 GGGCCCCCTGGCTCTCTTCTGGCATTCTCTGCCTCTGCCTCATATTCTTGTTAGGCCAGG 12 0 

121 TGGGCTTGCTGCAGGGACACCCCCAGTGCCTGGATTACGGGCCCCCTTTCCAGCCCCCTC 18 0 

181 TGCACCTTGAGTTTTGCTCTGACTATGAGTCCTTCGGCTGCTGTGATCAGCACAAGGACC 240 

241 GCCGCATCGCTGCCCGGTACTGGGACATCATGGAATATTTTGATCTGAAGAGACATGAGC 3 0 0 

3 01 TGTGTGGAGATTACATTAAAGACATCCTTTGCCAGGAGTGCTCGCCCTACGCAGCCCACT 3 60 

3 61 CTACGACGCCGAAAACACCCAGACGCCTCTCCGGAATCTCCCGGGCCTCTGCTCTGATTA 42 0 
421 CTGCTC TGC C TTC CATTCT AACTGTC ACTC AGC CATTTCC CTGCTGACCAATGACCGCGG 48 0 

4 81 CCTCCAGGAGTCTCATGGAAGGGACGGTACCCGCTTCTGCCACCTCCTGGACCTTCCTGA 540 

541 CAAGGACTATTGCTTCCCTAATGTCCTGAGGAACGACTATCTCAACCGCCACCTGGGCAT 600 

1 Ml 

6 01 GGTGGCCCAAGATCCTCAGGGCTGCCTGCAGCTCTGCCTGAGCGAGGTGGCCAACGGGCT 660 

2 VAQDPQGCLQL,CLSEVANGL 21 

6 61 GAGGAACCCCGTCTCCATGGTCCATGCTGGGGACGGCACCCATCGCTTCTTTGTTGCCGA 720 

22 RNPVS MVHAGDGTHRFFVAE 41 


721 GCAGGTAGGAGTGGTGTGGGTCTACCTCCCTGATGGGAGTC'GCCTGGAGCAACCCTTCCT 730 

42 QVGVVWVYL PDGSRLEQ PFL 61 

781 GGACCTCAAGAACATCGTGTTGACCACCCCATGGATCGGGGATGAGAGAGGC TTCTTGGG 840 

62 DLKNIVLTTPWIGDERGFLG 81 


841 GTTGGCTTTTCACCCCAAATTCCGCCACAATCGCAAGTTCTATATTTATTATTCGTGCCT 9 00 

82 LAFHPKFRHNRKFYIYYSCL 101 

901 GGACAAGAAGAAGGTAGAAAAGATCCGAATTAGTGAGATGAAGGTTTCTCGGGCTGATCC 960 

102 DKKKVEKIRI SEMKVSRADP 121 

961 TAACAAAGCTGACCTGAAATCAGAGAGGGTCATCTTGGAGATTGAAGAACCAGCCTCAAA 1020 

122 NKADLKSERVILEIEEPASN 141 

1021 CCATAATGGCGGACAAC TTCTTTTTGGC C TGGATGGCTATATGTACATATTCACTGGGGA 1080 

142 HNGGQLLFGLDGYMYIFTGD 161 

1081 CGGGGGACAGGCTGGAGATCCCTTTGGCCTGTTTGGAAATGCTCAGAACAAAAGTTCCCT 114 0 

162 GGQAGDPFGLFGNAQNKSSL, 181 
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1141 GCTGGGAAAAGTTTTAAGGATCGATGTGAACAGGGCAGGCTCACATGGCAAGCGGTACCG 12 00 

182 LGKVLRIDVNRAGSHGKRYR 201 

1201 AGTCCCCTCGGACAATCCATTTGTTTCTGAGCCAGGGGCCCACCCCGCCATCTATGCCTA 1260 

202 VPSDNPFVSEPGAHPAI YAY 221 


1261 TGGGATCAGGAACATGTGGCGTTGTGCTGTGGACCGAGGGGACCCCATCACGCGCCAGGG 132 0 

222 GIRNMWRCAVDRGDPITRQG 241 

1321 CCGAGGCCGGATATTCTGTGGGGACGTGGGCCAGAACAGGTTTGAAGAGGTTGACCTCAT 13 8 0 

242 RGRIFCGDVGQNRFEEVDLI 261 

13 81 TTTGAAAGGTGGAAACTATGGCTGGAGAGCAAAGGAAGGGTTTGCATGTTATGAC AAAAA 144 0 

262 LKGGNYGWRAKEGFACYDKK 281 

1441 ACTTTGTCACAATGCCTCTTTGGATGATGTTCTGCCAATCTATGCTTATGGCCATGCAGT 15 00 

282 LCHNAS LDDVL P IYAYGHAV 301 

1501 GGGGAAGTCAGTCACTGGAGGTTATGTCTATCGTGGTTGTGAATCCCCAAATCTCAATGG 15 6 0 

302 GKSVTGGYVYRGCESPNLNG 321 

1561 C C TGTATATCTTTGGAGAC TTCATGAGTGGTCGACTTATGGCTTTGCAGGAAGATAGAAA 162 0 

322 LYIFGDFMSGRLMALQEDRK 341 


1621 AAACAAGAAATGGAAGAAGCAGGATCTTTGCCTGGGCAGCACCACGTCCTGTGCCTTCCC 16 80 
342 NKKWKKQDLCLGSTTSCAFP 361 


1681 AGGGCTGATCAGCACCC ATAGC AAGTTCATC ATCTCCTTTGCTGAAGATGAAGCAGGGGA 1740 
362 GLISTHSKFIISFAEDEAGE 381 


17 41 GCTGTATTTCCTGGCGACCTCTTACCCAAGTGCCTATGC ACCACGTGGATCTATTTACAA 1300 
382 LYFLATSYPSAYAPRGS IYK 401 


18 01 GTTTGTTGACCCCTCAAGGCGAGCACCCCCAGGCAAGTGCAAATACAAGCCAGTGCCCGT 18 60 
402 FVDPSRRAPPGKCKYKPVPV 421 


18 61 GAGAACCAAGAGTAAGCGGATCCCGTTC AGACCACTCGCCAAGACAGTCTTGGACTTGCT 1920 

422 RTKSKRIPFRPLAKTVLDLL 441 

1921 AAAGGAACAATCAGAGAAAGCTGCTAGAAAATCTTCCAGTGCAACCTTAGCTTCTGGCCC 1980 

442 KEQSEKAARKSSSATLASGP 461 


1981 AGCCCAGGGTTTGTCTGAGAAAGGCTCCTCCAAGAAGCTGGCTTCTCCTACAAGCAGCAA 2040 
462 AQGLSEKGSSKKLAS PTSSK 481 


2 0 41 GAATACATTGCGAGGGCCTGGTACAAAGAAGAAAGCCAGAGTGGGGCCCCACGTCCGCCA 2100 
482 NTLRGPGTKKKARVGPHVRQ 501 
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2101 GGGC AAGAGGAGGAAGAGCCTGAAAAGCCACAGTGGCAGGATGAGGCCATCAGCAGAGCA 216 0 

502 GKRRK SLKSHSGRMRPSAEQ 521 

2161 GAAGCGAGCTGGC AGAAGTCTCCCTTGACCTATTGGTCAAGGTGGCCGACAGGGTGACGT 2 22 0 

522 KRAGRSLP* 530 

2221 GAGAGAGGAGAGCCACCTCATCAAATGAAAGTCACTGCTGAATAAAGACC TTAGAAGTCT 2 2 8 0 

2281 GGGAAGCCAGGGTAGAGGTGGGGCAGGGCGGTTTTCCTCTCCC TGGGAAATCTTGCTGTC 23 4 0 

2341 TACTGAATAAATAAATGCACCTTCTCTGTATGCAGTGCTTCTGTGGGAGACCATATCCCA 2 4 00 

2401 GATTGCTGGTGCAC CTGGGTTATGGTAAGCACTATCCATGAGC CTGCTTGGAATCACACT 2460 

24 61 GGATGTCTCCGTTTTGTCTTGTAAATGCCTACAACCTGAGGTAATAAATCAACATTTGCT 252 0 

2521 CAAACTGGCAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 2580 

2581 AAAAAAAAAAAAAAAAAAAAAAAAAAAAA 2609 
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I- _ HACftB64.aa 
PKPTPWSERVSVRFPGCRTGMHMISVSLRLVFCSFIPKA gi|1234787 
LKM LSF KLLL LAVALGF FEG gb|AAD31172 .1 


HRCaE64.aa 

GVLLG HPQCLDYGPP FKPLVHLEF gi|l234787 

DAKFGERS EGS GARRRRC LNGNPPKRLKRRDRRVMSQLEL gb|AAn31172.1 


110 
1 


120 
L. 


HAQAB64.aa 

CSE-YETFGCCDQDRDHVIAEKYWSIMDYFDLN gi|l234787 

LSGGE ILCGGFYPEVSCCLQSDSPGLG RLENKIFSAT gb|AAD31172 .1 


HACftB64.aa 

NYHICGGYIKDILCQECSPYAAHLYDAEDPHTPLR VI gi|l234787 

MSECSRLLEEI2CAPCSPHSQSLFYTPE-RDVLDGDLAL gb|AHB1172.1 


2 

134 
138 


HftCfiB64.aa 

PGLCFNYCSEFHLKCQNSITLLTEDKQIRESCDKGRDLFC gi|1234787 
PLLCKDYCKEFFYTCRGHI PGLLQTTA DEFC gb|AAD31172 .1 


dHACftB64.aa 
gij 1234787 

FYYARKDAGLCFPDFPRKQVRGPASNYLGQMEDYEKVGGI gb|AAD31172 .1 


HACfiB64.aa 
gi | 1234787 
LQlLg gb|AAD31172.1 



gi | 1234787 
LQS gb|AAD31172.1 




LB gi 1 1234787 
F lQ gb|AAD31172.1 


157 I 
355 I 
367 I 


| HAOAB64.aa 
3 DGRRAS gi |1234787 
1 LEE DTDMCM gb|MDG1172.1 
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mQAB64.aa 
oMBB gi [1234787 
hB gb|ARD31172.1 


I HftCftB64.aa 

a Dig I -Bv TjMMa|MilliMIHa gi|l234787 
T D I H I DJ L tHlBsHsnB kBE S S A R I0GB I H RdQe gb ARD31172a 




■ HRCRB64.aa 

S I V NgS V K P T QEJ pQg KTPTKFPTKVPPTPTKFPTKV gi|l234787 
Sgr? TvgpAQP gb|AMBU72.1 


674 P P T|3 T@ FPTKVPPIPTKVPSKVPPTPTQFPTKV P P T P tQv gijl234787 
604 gb|MD31172.1 


714 B 10 Vfls TPT1AHTK wlaltlJJ'J 8 r. p g if JcBIn n m ^ P t v m gi |i234787 

604 LTSDCgRL C R NEY Y^P TGKCCCSPGW gb|flflD31172.1 


i ptBl 


HflCKB64.aa 

754 PTfJ LPTKVPPITTKVSNKVLLTSPELPTKVPPTPTKLPTN gi|1234787 
630 EEDF gb|AftD31172.1 


794 APPTSILLSPTPIKLPTKISLTLTSVPIKNQLTSAKLLTT gi|l234787 
634 gb | AACQ 1172.1 
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HRCRB64.aa 

TLPISTKRATKLPSTSTSVPSNTSCILTHVQPKMLPTETR gijl234787 
CRIAKC E PACRHGGVCV gb|AAU31172.1 


504 
874 


__ - - IWtM HAOAB64.aa 

VPNKMPPKPTRI PTMSMYITKK PpIBkhIaKKVTD kBPtT gi|l234787 

"PHKCLCKKGYLGPQCEQVDHNvBrVT gb|ASD31172 .1 


_ ____ - - HACftB64.aa 
SPBTTKPPKPPKSKTSVVNQPKKKETKTGVNN K T K mWp k gi|1234787 

"AGILDQ I I DMTS YEF- - gb|AM>31172 .1 


530 
954 


HACAB64.aa 

AKEPKKEKKTIKVKQPVSHYFPPQKPKKQKTKKMQKEGNE gi|l2347S7 
LDLTSYIV gb|AftD31172.1 


530 


HA0AB64 .aa 


994 K s gi 1 1234787 

700 gb|AfiD31172.1 

Decoration 'Decoration #1' : Shade (with solid black) residues that match HA0fiB64.aa exactly. 
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FIG. 4A 


CACCAGCACCCCGCCCAGAGCAGTGCCGCTGCCCAAATCCTCGCAGGCAGCTCATCAACG 
CAATTGCAACTCCGGCTGGAGCCCCGGACCTGCAAGCCTGGGTGTCCGTGGGTCCGTCTG 
C CCAGCCATCTGC TGGTGGCAC C TCTCC CTCCTGC CGCCTCC CTC GGTGAACCC C ACCTT 


GCAGAAGTGCAGCTCGCCCGGAGCAGCCCAGGAGCTCAGCATGCGTCCCCCAGGCTTCAG 

M R P P G F R 

GAACTTCTTGCTGCrGGCGTCCTCCCTTCTCTTTGCTGGGTTGTCAGCTGTTCCTCAAAG 
NFLLLASSLLFAGLSA V P Q S 


CTTCTCGCCATCTCTGAGGAGCTGGCCGGGCGCCGCCTGCAGGCTGTCCCGGGCCGAGTC 
FSPSLRSWPGAACRLSRAES 


360 
47 


361 
48 


GGAGCGACGCTGCCGCGCACCTGGGCAG^CC^CGGGG^CCGCGCTGTGCCACGGCCGGGG 
ERRCRAP '" 


421 
68 


480 
87 


481 
88 


CC ^^ GTGAGTGCCATGAGTGGGTGTGC g^ A P CTA CGACGGGAGCACCTGTGCAGGCCA 
1JUUUJLB E C H E W V C 


540 
107 


541 
108 


600 
127 


601 
128 


GTA£CCAACTAACTGTGACTTGACAAAGAAGAAAAGTAACCAAA^ 
"** T N C D L T K K K S N Q 


660 
147 


661 
148 


720 
167 


721 
168 


AGA?GGAAGTGGACTTGTGTATGGTAAATT"' , ['GTGACTGTGACGA'rAGAGAATGCATAGA 
KFCECDDREC WMi 


780 
187 


781 

18 


840 
207 


841 
208 


C ^ G ^£ G - G ?^ GGCATGG AGATAAAT GTGAA 

E F O C D I T P W E S 


900 

227 
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FIG. 4B 


901 
228 


960 
247 


951 
248 


G D I H G D 


1020 
267 


1021 
268 


CACCTGTGAATGTGATGAGAGGGACTGTAGAGCTGTCTATGACCGATATTCTGATGACTT 
TCECDERDCRAVY 


1080 
287 


1081 
288 


1140 
307 


1141 
308 


G M^SSiS^^Si^^rS^ , CAGTCC TGCACGCTGTCAGC TGAGGAGAGCATCAGGAAGTG 
S C T L S AEESIRKi 


1200 
327 


1201 
328 


CCA©3GAAG^ 


1260 
347 


1261 
348 


""tcecddrr 


1320 
367 


1321 
368 


CTGTGAA^CC^^ 


1380 
3 87 


1381 
388 


OHPRKC MMT 


1440 
407 


1441 
408 


GGAAGAACAAAGCAAGAATCTGTGTGAAT^A^AGATGGCATATTGTGCTCGGGGAAGGG 
EEQSKNLCE 


1500 
427 


1501 
42 8 


1560 
447 


1561 
448 


CTGTGACTGTGATGACAGAGAC TGCGACAAACA TGATGGTCTCATTTGTACAGGGAATGG 
I D C D D R D 


1620 
467 


1621 
468 


1680 
487 


1681 
488 


CTGGCTTGGCTCAGAATATCCTTAACAATTACATGAGAGAGGTCTGGATTCTTATTTTTT 
W L G S E Y P * 


1740 
495 
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FIG. 4C 

17 41 CTGGGCCATTAGAACATATAAATGCGAAGGAAACC ATGTATATTCACCACTAGGACAGGT 1800 

18 01 TAAAAAGACCATTGTATGTTTTTCTATTTCTGAATTACGAATGAAATCCGAGTACCTATT 18 60 

18 61 AGAAATGAGTTATGCAAATTTAGATGCAAATAACATTAGAAAAAAAAGATTCTTCCATAA 1920 

19 21 TTAACATAAGTGGTTCCTAACGAGAGCAATTTTTCCACCCAAAAGTCATTTGGCAACATC 19 80 
1981 TACAGACAATTTTGATTGTCACACTGGGTCGGGTAGGAAGGTATGCTGCAGACATTTGGT 2040 
2 041 GGGTAGAGGCCAGGGATGCTGCTGAGCATCCCGCAGTGTACAGGACAGCCCCCAAACAAG 2100 
2101 GAATTATCCAGCCCCAAATGCCAATAGGGCTCAGACTGAGAAACATTGAGTTATATGGCT 2160 
2161 ATTAGAAATCCACATTCTTAC ACAAGAAAGACCATATTAGAATCTAAGGAAAACATGCAT 2220 
2221 ATTCACATTAATTAATCGATCAGATTTTTCCAGAATTCCGTATCAGTCAC CATTTTAATA 22 80 

22 81 TGGGGACAATGAAGACAAGCACACAGGAGGTAGAATATCAGAGTGGGGCTGGATCAAGGG 23 40 

23 41 CAAAAACTGGTCATTAAGTCATCTGACATTAAATCATTTAGCCACTAAGTTATTTGTGTA 240 0 

24 01 CTCTCACTTTAAACTCACCAAAGAAGATTCTCTTAAAGAAATTATGAAAAATGTACAATT 2 4 6 0 
24 61 TAACATTTTAAATAAATAGTGACAGAAGTTGTTTAAAAA 2 4 9 9 
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FIG. 7A 

1 GGCACGAGAGCGTGACCCAGCTGCGGCCGGCCAGCCATGGAGACTGGAGCGCTGCGGCGC 60 
1 METGALRR 8 

61 CCGCAACTTCTCCCGTTGCTGCTGCTGCTCTGCGGTGGGTGTCCCAGAGCAGGCGGCTGC 12 0 
9 PQLLPLLLLLCGGCPRAGG C 28 

121 AACGAGACAGGCATGTTGGAGAGGCTGCCCCTGTGTGGGAAGGCTTTCGCAGACATGATG 18 0 

29 NETGMLERL PLCGKAFADMM 48 

181 GGCAAGGTGGACGTCTGGAAGTGGTGCAACCTGTCCGAGTTCATCGTGTACTATGAGAGT 2 40 

49GKVDVWKWCKLSEFIVYYES 68 

241 TTCAC CAAC TGCACCGAGATGG AGGCCAATGTC GTGGGCTGCTACTGGCCCAAC CCC C TG 3 0 0 

69FTNCTEMEANVVGCYWPNPL 88 

3 01 GCCCAGGGCTTCATCACCGGC ATCCACAGGCAGTTCTTCTCCAACTGCACCGTGGACAGG 3 6 0 

89AQGF1TGIHRQFFSNCTVDR 108 

3 61 GTCCACTTGGAGGACCCCCCAGACGAGGTTCTCATCCCGCTGATCGTTATACCCGTCGTT 420 
109 VHLEDPPDEVLI P L I V T P V V 128 

421 CTGACTGTCGCCATGGCTGGCCTGGTGGTGTGGCGCAGCAAACGCACCGACACGCTGCTG 480 

129 LTVAMA GLVV WRSKRTDTLL 148 

4 81 TGAGGGTCCCGGTGAGATGGAGTGGGTCACACCTGGCAAGCTGGAAGAAAGTTCCCTGGG 540 
149 * 149 

541 GATGGGAGAGC GGGTGGGTGCTGC CAATCTCCAGC TACTGTGGCCACAC CCCACCTGGTC 600 

6 01 ATGGGCAGACCCCTCCCTTCCTGGGCTGACCTGCTCCCTCGAGGCCAGCCTGCTCCCTGG 660 

661 CTGAGGCTCAGGCTATCCGCCCAAGCTCTTTGCTCATTCTAGGGCCAGTGGAGGAAAATG 72 0 

721 TGATAAGGCCAGAGCTTGTGTGCTGGGCACAGAAATCACCTGCTGCATCCTGTGCTCCGC 780 

781 AGCTGGGCCGGACCTCTGCCCGCAGGTTTCTATGCTGTTTCTTAGCACAGAATCCAGCCT 840 

841 AGCCTTAGCCGCAGTCTAAGCCCTGCTTGGACTAGGACTCCTTGCTTGACCCCATCTCTG 900 

901 GTTCCTGCCCTGGCTCCTGCACCAGCCCCAGCTCCTGCCTACATCCAGGCAGAAAGATAG 9 60 
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FIG. 7B 

961 CAGGGGCTCTTGGAAGACGTTCCGTGCTGTGACCTCCGAGCCCTCCTGGTGGGAAGACAG 102 0 

1021 CTGGAAAGGCTGGGAGGAGAAGGGAGGGGTTGGGGGTTC CCAGGAGCCATGCGTGGCCTG 10 8 0 

1081 CAGAGTC CATTCCATCATGATGCTGTGCCCGC TATGGGCTGTGTC C ATGAC CAGAGGC TG 1140 

1141 GAGTGGGGGTGTGTTAGAGCCCCTCACCGGGACTTGCTGTGCGGATGGGGCCTGGGCCTC 12 0 0 

12 01 CTTCCTACAGGGGCTCCTCTGTGGGTGAGGGGCCCTCTGGAATGGCATCCCATGAGCTTG 12 60 

12 61 TGGCCTCTATCTGCTACCATCTGTGTTTTATCTGAGTAAAGTTACCTTACTTCTGGAAAA 13 2 0 
1321 AAAAAAAAAAAAAAAAAAA 1339 
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FIG. 9 
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FIG. 10A 


1 GGCACGAGGACAGCCTCCACCAGAGTCCCCACCTTTCTGGAAGCTGCAGGGCTCTCCATC 60 


61 CAGGATCCAGAAGCATTGAAGGGGACCAGCCGCTGAAGGGATTCTCAGTCCCATCTGACT 120 


12 1 CCCCATGAGGCTCCTGGCTTTCCTGAGTCTGCTGGCCTTGGTGCTGCAGGAGACAGGGAC 180 

1 MRLLAFLSLLALVLQETGT 19 

181 AGCTTCTCTCCCAAGGAAGGAGAGGAAGAGGAGAGAGGAGCAGATGCCCAGGGAAGGCGA 240 

20 A S LPRKERKRREEQMPREGD 39 


241 TTCCTTTGAAGTTCTGCCTCTGCGGAATGATGTCCTGAACCCAGACAACTATGGTGAAGT 3 00 

40 SFEVLP'LRNDVLNPDNYGEV 59 

301 CATTGACCTGAGCAACTATGAGGAGCTCACAGATTATGGGGACCAACTCCCCGAGGTTAA 3 60 

60 IDLSNYEELTDYGDQLPEVK 79 


3 61 GGTGACTAGCCTCGCTCCTGCAACCAGCATCAGTCCCGCCAAGAGCACTACGGCTCCAGG 42 0 
80 VTSLAPATS I SPAKSTTAPG 99 


421 GACACCCTCGTCAAACCCCACGATGAC CAGACCTACTACAGC AGGGCTGCTACTGAGTTC 4 80 

100 TPSSNPTMTRPTTAGLLLSS 119 

481 CCAGCCCAACCATGGTCTGCCCACCTGCCTGGTCTGCGTGTGCCTCGGTTCCTCTGTGTA 540 

120 QPNHGIiPTCLVCVCLGSSVY 139 


541 TTGCGATGACATTGACCTAGAGGACATTCCTCCTCTTCCTCGGAGGACTGCCTACCTGTA 6 0 0 

140 CDDIDLEDIPPLPRRTAYLY 159 

601 TGCACGCTTCAACCGCATCAGCCGTATCAGGGCCGAAGACTTCAAAGGGCTGACAAAGTT 660 

160 ARFNRISRIRAEDFKGLTKL 179 


661 GAAGAGGATTGACCTCTCCAACAACCTCATTTCCTCCATCGATAATGATGCCTTCCGCCT 7 20 
180 KRXDLSNNLI SSIDNDAFRL 199 


721 GCTACATGCCCTCCAGGACCTCATCCTCCCAGAGAACCAGTTGGAAGCTCTGCCCGTGCT 780 

200 LHALQDLI LPENQLEALPVL 219 

781 GCCCAGTGGCATTGAGTTCCTGGATGTCCGCCTAAATCGGCTCCAGAGCTCGGGGATACA 840 

220 PSGIEFLDVRLNRLQSSGIQ 23 9 


841 GCC TGCAGCCTTCAGGGC AATGGAGAAGCTGCAGTTCCTTTACCTGTCAGACAACCTGC T 900 
240 PAAFRAMEKLQFLYLSDNLL 259 
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FIG. 10B 

901 GGATTCTATCCCGGGGCCTTTGCCCCCGAGCCTGCGCTCTGTACACCTGCAGAATAACCT 9 60 

260 DSIPGPLPPSLRSVHLQNNL 279 

961 GATAGAGACCATGCAGAGAGACGTCTTCTGTGACCCCGAGGAGCACAAACACACCCGCAG 102 0 

280 IETMQRDVFCDPEEHKHTRR 299 

1021 GCAGCTGGAAGACATCCGCCTGGATGGCAACCCCATCAACCTCAGCCTCTTCCCCAGCGC 10 8 0 

300 QLEDIRLDGNPINLSLFPSA 319 

1081 CTACTTCTGCCTGCCTCGGCTCCCCATCGGCCGCTTCACGTAGCTCGGAGCCCTTCCACT 1140 

320 YFCLPRLPIGRFT* 333 

1141 CCTCCCAGGTCATCTCTTGGACCAGCGGGGATCACATTCTCCAGCAGCCGCCATCTCACA 12 00 

12 01 CGCCTCCCTCCTGTGGCCGCCGGCAGCATGGACAAAGGTCTCCATGCAGGGGGAGGAGGC 12 60 

12 61 CTGCTTCTTTCCCCACAGCTCTCACGTCTCCCTTCTCCCTGCGGGTGACAAAGAAGCCCA 13 2 0 

13 21 AGGACCACCTCCTTCCTGCCTCATTGTAATAAAATTCCCCACACTGAAAAAAAAAAAAAA 1380 
1381 AAAAAAAAA 1389 
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: R K Ria EKOM_P_R_^ HaRftD44.aa 
} ^TSTWTTO gi 1 1620005 
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I P E HARAD44.aa 
I gi 1 1620005 



278 
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FE HARAD44.aa 
rJB gi 1 1620005 


318 
308 


HARAD44 .aa 
gi 1 1620005 


Decoration 'Decoration #1' : Shade (with solid black) residues that match the Consensus 
exactly. 
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FIG. 13A 

1 CCACGCGTCCGAGAGAACAGGCCTGTCTCAGGCAGGCCCTGCGCCTCCTATGCGGAGATG 60 

1 M 1 

6 1 CTACTGCCACTGCTGCTGTCCTCGCTGCTGGGCGGGTCCCAGGCTATGGATGGGAGATTC 120 

2 kLPLLLSSLLGGSQA M D G R P 21 

121 TGGATACGAGTGCAGGAGTCAGTGATGGTGCCGGAGGCCTGTGACATCTCTGTGCCCTGC 180 
22 WIRVQESVMVPEACDI SVPC 41 

181 TCTTTCTCCTACCCCCGACAAGACTGGACAGGGTCTACCCCAGCTTATGGCTACTGGTTC 240 

42SFSYPRQDWTGSTPAYGYWF 61 

241 AAAGCAGTGACTGAGAC AACCAAGGGTGCTCCTGTGGCCACAAAC C ACCAGAGTCGAGAG 3 00 

62KAVTETTKGAPVATNHQSRE 81 

3 01 GTGGAAATGAGCACCCGGGGCCGATTCCAGCTCACTGGGGATCCCGCCAAGGGGAACTGC 3 6 0 

B2VEMSTRGRFQLTGDPAKGNC 101 

3 61 TCCTTGGTGATCAGAGACGCGCAGATGCAGGATGAGTCACAGTACTTCTTTCGGGTGGAG 42 0 

102 SLVIRDAQMQDESQYFFRVE 121 

421 AGAGGAAGCTATGTGAGATATAATTTCATGAACGATGGGTTC TTTCTAAAAGTAACAGTG 48 0 

122 RGSYVRYNFMNDGFFLKVTV 141 

481 CTCAGCTTCACGCCCAGACCCCAGGACCACAACACCGACCTCACCTGCCATGTGGACTTC 540 

142 LSFTPRPQDHNTDLTCHVDF 161 

541 TCCAGAAAGGGTGTGAGCGCACAGAGGACCGTCCGACTCCGTGTGGCCTATGCCCCCAGA 60 0 

162SRKGVSAQRTVRLRVAYAPR 181 

601 GACCTTGTTATCAGCATTTCACGTGACAACACGCCAGCCCTGGAGCCCCAGCCCCAGGGA 660 

182 DLVISI SRDNTPALEPQPQG 201 

661 AATGTCCCATACCTGGAAGCCCAAAAAGGCCAGTTCCTGCGGCTCCTCTGTGCTGCTGAC 720 

202 NVPYLEAQKGQFLRLLCAAD 221 

7 21 AGCCAGCCCCCTGCCACACTGAGCTGGGTCCTGCAGAACAGAGTCCTCTCCTCGTCCCAT 780 

222 SQPPATLSWVLQNRVLS SSH 241 

781 CCCTGGGGCCCTAGACCCCTGGGGCTGGAGCTGCCCGGGGTGAAGGCTGGGGATTCAGGG 840 

242 PWGPRPLGLEL PGVKAGDSG 261 
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FIG. 13B 

841 CGCTACACCTGCCGAGCGGAGAACAGGCTTGGCTCCCAGCAGCGAGCCCTGGACCTCTCT 900 

262 RYTCRAENRLGSQQRALDLS 281 

901 GTGCAGTATCCTCCAGAGAACCTGAGAGTGATGGTTTCCCAAGCAAACAGGACAGTCCTG 960 

282 VQYPPENLRVMVSQANRTVL 3 01 

961 GAAAACCTTGGGAACGGCACGTCTCTCCCAGTACTGGAGGGCCAAAGCCTGTGCCTGGTC 102 0 

302 ENLGNGTSLPVLEGQSLCLV 321 

1021 TGTGTCACACACAGCAGCCCCCCAGCCAGGCTGAGCTGGACCCAGAGGGGACAGGTTCTG 1080 

322 CVTHSSPPARLSWTQRGQVL 341 

1 0 8 1 AGCCCCTCCCAGCCCTCAGACCCCGGGGTCCTGGAGCTGCCTCGGGTTCAAGTGGAGCAC 114 0 

342 SPSQPSDPGVLELPRVQV 


H 361 


1141 GAAGGAGAGTTCACCTGCCACGCTCGGCACCCACTGGGCTCCCAGCACGTCTCTCTCAGC 12 00 
362 EGEFTCHARHPLGSQHVSLS 381 

12 01 CTCTCCGTGCACTACTC CCCGAAGCTGCTGGGCCCCTCCTGCTCCTGGGAGGCTGAGGGT 12 6 0 


382 LSVHYSPKLLGPSCSWEAEG 


401 


12 61 CTGCACTGCAGCTGCTCCTCCCAGGCCAGCCCGGCCCCCTCTCTGCGCTGGTGGCTTGGG 13 20 


402 LHCSCSSQASPAPSLR 


W W L G 421 


1321 GAGGAGCTGCTGGAGGGGAACAGCAGCCAGGACTCCTTCGAGGTCACCCCCAGCTCAGCC 13 80 

422 EELLEGNSSQDSFEVTPSSA 441 

1381 GGGCCCTGGGCCAACAGCTCCCTGAGCCTCCATGGAGGGCTCAGCTCCGGCCTCAGGCTC 1440 

442 GPWANSSLSLHGGLSSGLRL 461 

1441 CGCTGTGAGGCCTGGAACGTCCATGGGGCCCAGAGTGGATCCATCCTGCAGCTGCCAGAT 1500 

462 RCEAWNVHGAQSGSILQLPD 481 

15 01 AAGAAGGGACTCATCTCAACGGCATTCTCCAACGGAGCGTTTCTGGGAATCGGCATCACG 1560 

482 KKGLI STAFSNGAF L G T G T T 501 

1561 GCTCTTCTTTTCCTCTGCCTGGCCCTGATCATCATGAAGATTCTACCGAAGAGACGGACT 1620 

502 ALLFLC LALI I MKILPKRRT 521 

1621 CAGACAGAAAC C CCGAGGCCCAGGTTCTCCCGGCACAGCACGATCCTGGATTACATCAAT 1680 

522 QTETPRPRFSRHSTILDYIN 541 


SUBSTITUTE SHEET (RULE 26) 


WO 00/29435 24 / 59 PCT/US99/25031 

FIG. 13C 

1681 GTGGTCCCGACGGCTGGCCCCCTGGCTCAGAAGCCteAATCAGAAAGCCACACCAAACAGT 1740 

542 VVPTAGPLAQKRNQKATPNS 561 

17 41 CCTCGGACCCCTCTTCCACCAGGTGCTCCCTCCCCAGAATCAAAGAAGAACCAGAAAAAG 1800 
562 PRTPLPPGAPSPESKKNQKK 581 

18 01 CAGTATCAGTTGCCCAGTTTCCCAGAACCCAAATCATCCACTCAAGCCCCAGAATCCCAG 1860 
582 QYQLPSFPEPKSSTQAPESQ 601 

18 61 GAGAGCCAAGAGGAGCTCCATTATGCCACGCTCAACTTCCCAGGCGTCAGACCCAGGCCT 1920 
602 ESQEELHYATLNFPGVRPRP 621 

19 21 GAGGCCCGGATGCCCAAGGGCACCCAGGCGGATTATGCAGAAGTCAAGTTCCAATGAGGG 19 80 
622 EARMPKGTQADYAEVKFQ * 640 

1981 TCTCTTAGGCTTTAGGACTGGGACTTCGGCTAGGAGGAAGGTAGAGTAGAAGTTGAAGAT 2040 

2 041 AACCGAGTGCAAGAGTTTCCTTCTCTCCCTCTCTCTCTCTCTTTCTCTCTCTCTCTCTCT 2100 

2101 TTCTCTCTCTTTTTAAAAAAACTCTGGCCACGGCACAGTGGCTCACGCCTGTAATCCCAG 2160 

2161 CACTTTGGG AAGTTGAAGTTGGGC CAAATCC C C TGAGTCCGGATTC GAAAACAGCCTGGC 2220 

2221 CAACTTGTGAAAACCCGTCTCTACTTAAAAATACCCAAATTACCTGGGCATTGTGGCAGG 22 8 0 

22 81 GGCCTGTTTATCCTT 2295 
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638 


HDPCL05.aa 


363 -mma gi [88178 

Decoration 'Decoration #1" : Shade (with solid black) residues that match the Consensus 
exactly. 
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FIG. 16A 


1 AGACGTTCCCTCGCGGCCCTGGCACCTCCAACCCCAGATATGCTGCTGCTGCTGCTGCTG 60 
1 M L L. L L L L 7 

6 1 CCCCTGCTCTGGGGGAGGGAGAGGGTGGAAGGACAGAAGAGT AACCGGAAGGATTACTCG 120 

8 £ LLWGRERVEG QKSNRKDYS 27 

121 CTGACGATGCAGAGTTCCGTGACCGTGCAAGAGGGCATGTGTGTCCATGTGCGCTGCTCC ISO 

28 LTMQS SVTVQEGMCVHVRC S 47 

181 TTCTCCTACCCAGTGGACAGCCAGACTGACTCTGACCCAGTTCATGGCTACTGGTTCCGG 2 4 0 

4SFSYPVDSQTDSDPVHGYWFR 67 

241 GCAGGGAATGATATAAGCTGGAAGGCTCCAGTGGCCACAAACAACCCAGCTTGGGCAGTG 3 00 

68 AGNDI SWKAPVATNNPAWAV 87 

301 C AGGAGGAAACTCGGGAC CGATTCCACCTC CTTGGGGAC C CAC AGACC AAAAATTGCACC 360 

8SQEETRDRFHLLGDPQTKNCT 107 

3 61 CTGAGCATCAGAGATGCCAGAATGAGTGATGCGGGGAGATACTTCTTTCGTATGGAGAAA 420 

108 LSIRDARMSDAGRYFFRMEK 127 

421 GGAAATATAAAATGGAATTATAAATATGACCAGCTCTCTGTGAACGTGACAGCCTTGACC 480 

128 GNIKWNYKYDQLSVNVTALT 147 

481 CACAGGCCCAACATCCTTATCCCCGGTACCCTGGAGTCTGGCTGCTTCCAGAATCTGACC 540 

148 HRPNXLIPGTLESGCFQNLT 167 

541 TGCTCTGTGCCCTGGGCCTGTGAGCAGGGGACGCCCCCTATGATCTCCTGGATGGGGACC 600 

168 CSVPWACEQGTPPMISWMGT 187 

6 01 TCTGTGTCCCCCCTGCACCCCTCCACCACCCGCTCCTCAGTGCTCACCCTCATCCCACAG 660 

188SVSPLHPSTTRSSVLTLIPQ 207 

661 CCCCAGCACCACGGCACCAGCCTCACCTGTCAGGTGACCTTGCCTGGGGCCGGCGTGACC 720 

208 PQHHGTSLTCQVTLPGAGVT 227 

721 ACGAACAGGACCATCCAACTCAATGTGTCCTACCCTCCTCAGAACTTGACTGTGACTGTC 780 

228 TNRTIQLNVSYPPQNLTVTV 247 

781 TTCCAAGGAGAAGGCACAGCATCCACAGCTCTGGGGAACAGCTCATCTCTTTCAGTCCTA 840 

248 FQGEGTASTALGNSSSLSVL 267 
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FIG. 16B 

841 GAGGGCCAGTCTCTGCGCTTGGTCTGTGCTGTTGACAGCAATCCCCCTGCCAGGCTGAGC 900 

268 EGQSLRLVCAVDSNPPARLS 287 

901 TGGACCTGGAGGAGTCTGACCCTGTACCCCTCACAGCCCTCAAACCCTCTGGTACTGGAG 960 

288 WTWRSLTLYPSQPSNPLVLE 307 

961 CTGCAAGTGCACCTGGGGGATGAAGGGGAATTCACCTGTCGAGCTCAGAACTCTCTGGGT 102 0 

308 LQVHLGDEGEFTCRAQNSLG 327 

1021 TCCCAGCACGTTTCCCTGAACCTCTCCCTGCAACAGGAGTACACAGGCAAAATGAGGCCT 1080 

328 SQHVSLNLSLQQEYTGKMRP 347 

1081 GTATCAGGAGTGTTGCTGGGGGCGGTCGGGGGAGCTGGAGCCACAGCCCTGGTCTTCCTC 1140 

348 VSGVLLGAVGGA GATALVFT. 357 

1141 TCCTTCTGTGTCATCTTCATTGTAGTGAGGTCCTGC AGGAAGAAATCGGCAAGGCCAGCA 12 00 

368 S F C V I F T V V RSCRKKSARPA 387 

1201 GCGGACGTGGGAGACATAGGCATGAAGGATGCAAACACCATCAGGGGCTCAGCCTCTCAG 12 6 0 

388 ADVGDIGMKDANTIRGSASQ 407 

1261 GGTAACCTGACTGAGTCCTGGGCAGATGAT AACCCCCGACACCATGGCCTGGCTGCCCAC 13 2 0 

408 GNLTESWADDNPRHHGLAAH 427 

1321 TC C TCAGGGGAGGAAAGAGAGATC C AGTATGCAC C CCTCAGCTTTCATAAGGGGGAGCCT 13 80 

428 SSGEEREIQYAPLSFHKGEP 447 

13 81 CAGGACCTATCAGGTCAAG AAGCCAC CAACAATGAGTAC TC AGAGATC AAGATCCC CAAG 1440 
448 QDLSGQEATNNEYSEIKI PK 467 

14 41 TAAGAAAATGCAGAGGC TC GGGCTTGTTTGAGGGTTCACGACCCCTCCAGCAAAGGAGTC 1500 
468 * 468 

1501 TGAGGCTGATTCCAGTAGAATTAGCAGCC CTCAATGCTGTGCAACAAGACATCAGAACTT 15 6 0 

15 61 ATTCCTCTTGTCTAACTGAAAATGCATGCCTGATGACCAAACTCTCCCTTTCCCCATCCA 1620 
1621 ATCGGTCCACACTCCCCGCCCTGGCCTCTGGTACCCACCATTCTCCTCTGTACTTCTCTA 168 0 
1681 AGGATGACTACTTTAGATTCCGAATATAGTGAGATTGTAACGTGAAAAAAAAAAAAAAAA 1740 
1741 AAAAAAAA 1748 
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1 GCCGCGCCGAGGAGGCTGCCGCTCTGGCTTGCCGCCCCCCGCCGCCGCTGCACACCGGAC 6 0 

6 1 CCAGCCGCCGTGCCGCGGGCCATGGACCTGCCCAGGGGCCTGGTGGTGGCCTGGGCGCTC 120 
1 MDLPRGLVVAWAL 13 

12 1 AGCCTGTGGCCAGGGTTCACGGACACCTTCAACATGGACACCAGGAAGCCCCGGGTCATC 180 
14 SLWFGFTDT FNMDTRKPRVI 33 

181 CCTGGCTCCAGGACCGCCTTCTTTGGCTACACAGTGCAGCAGCACGACATCAGTGGCAAT 240 

34PGSRTAFFGYTVQQHDISGN 53 

241 AAGTGGCTGGTCGTGGGCGCCCCACTGGAAACCAATGGCTACCAGAAGACGGGAGACGTG 3 00 

54KWLVVGAPLETNGYQKTGDV 73 

3 01 TACAAGTGTCCAGTGATCCACGGGAACTGCACCAAACTCAACCTGGGAAGGGTCACCCTG 3 60 

74YKCPVIHGNCTKLHLGRVTL 93 

3 61 TCCAACGTGTCCGAGCGGAAAGACAACATGCGCCTCGGCCTTAGTCTCGCCACCAACCCC 42 0 

94SMVSERKDNMRLGLSLATNP 113 

42 1 AAGGACAACAGCTTCCTGGCCTGCAGCCCCCTCTGGTCTCATGAGTGTGGGAGCTCCTAC 480 

114KDNSFLACSPLWSHECGSSY 133 

481 TACACCACAGGGATGTGTTCAAGAGTCAACTCCAACTTCAGGTTCTCCAAGACCGTGGCC 54 0 

134 YTTGMCSRVNSNFRFSKTVA 153 

541 CCAGCTCTCCAAAGGTGCCAGACCTACATGGACATCGTCATTGTCCTGGATGGCTCCAAC 6 00 

154PALQRCQTYMDIVIVLDGSN 173 

601 AGCATCTACCCCTGGGTGGAGGTTCAGCACTTCCTCATCAACATCCTGAAAAAGTTTTAC 660 

174SIYPWVEVQHFLINILKKFY 193 

6 61 ATTGGCC C AGGGC AGATC CAGGTTGGAGTTGTGCAGTATGGC GAAGATGTGGTGCATGAG 720 
194 IGPGQI QVGVVQYGEDVVHE 213 

721 TTTCACCTCAATGACT ACAGGTCTGTAAAAGATGTGGTGGAAGCTGCCAGCCACATTGAG 780 

214FHLNDYRSVKDVVEAASHIE 233 

7 81 CAGAGAGGAGGAACAGAGACCCGGACGGCATTTGGCATTGAATTTGCACGCTCAGAGGCT 840 
234 QRGGTETRTAFGIEFARSEA 253 
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FIG. 19B 

841 TTCCAGAAGGGTGGAAGGAAAGGAGCCAAGAAGGTGATGATTGTCATCACAGATGGGGAG 900 

254 FQKGGRKGAKKVMIVITDGE 273 

901 TCCCACGACAGCCCAGACCTGGAGAAGGTGATCCAGCAAAGCGAAAGAGACAACGTAACA 960 

274 SHDSPDLEKVIQQSERDNVT 293 

961 AGATATGCGGTGGCCGTCCTGGGCTACTACAACCGCAGGGGGATCAATCCAGAAACTTTT 102 0 

294 RYAVAVLGYYNRRGINPETF 313 

1021 CTAAATGAAATCAAATACATCGCCAGTGACCCTGATGACAAGCACTTCTTCAATGTCACT 1 0 S 0 

314 LNEIKYIASDPDDKHFFNVT 333 

1 0 S 1 GATGAGGCTGCCTTGAAGGACATTGTCGATGCCCTGGGGGACAGAATCTTCAGCCTGGAA 1140 

334 DEAALKDIVDALGDRTFSLE 353 

1141 GGCACCAACAAGAACGAGACCTCCTTTGGGCTGGAGATGTCACAGACGGGCTTTTCCTCG 12 0 0 

354 GTNKNETSFGLEMSQTGFSS 373 

1201 CACGTGGTGGAGGATGGGGTTCTGCTGGGAGCCGTCGGTGCCTATGACTGGAATGGAGCT 12 60 

374 HVVEDGVLLGAVGAYDWNGA 393 

12 61 GTGCTAAAGGAGACGAGTGCCGGGAAGGTCATTCCTCTCCGCGAGTCCTACCTGAAAGAG 13 2 0 
394 VLKETSAGKVIPLRESYLKE 413 

1321 TTCCCCGAGGAGCTCAAGAACCATGGTGCATACCTGGGGTACACAGTCACATCGGTCGTG 13 80 

414 FPEELKNHGAYLGYTVTSVV 433 

13 81 TCCTCCAGGCAGGGGCGAGTGTACGTGGCCGGAGCCCCCCGGTTCAACCACACGGGCAAG 144 0 
434 S SRQGRVYVAGAPRFNHTGK 453 

1441 GTCATCCTGTTCACCATGCACAACAACCGGAGCCTCACCATCCACCAGGCTATGCGGGGC 1500 

454 VILFTMHNNRSLTIHQAMRG 473 

1501 CAGCAGATAGGCTCTTACTTTGGGAGTGAAATCACCTCGGTGGACATCGACGGCGACGGC 1560 

474 QQIGSYFGSEITSVDIDGDG 493 

15 61 GTGACTGATGTCCTGCTGGTGGGCGCACCCATGTACTTCAACGAGGGCCGTGAGCGAGGC 1620 

494 VTDVLLVGAPMYFNEGRERG 513 

1621 AAGGTGTACGTCTATGAGC TGAGACAGAACCGGTTTGTTTATAACGGAACGCTAAAGGAT 1680 

514 KVYVYELRQNRFVYNGTLKD 533 
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1681 TCACACAGTTACCAGAATGCCCGATTTGGGTCCTCCATTGCCTCAGTTCGAGACCTCAAC 1740 

534 SHSYQNARFGS S IASVRDLN 553 

1741 CAGGATTCCTACAATGACGTGGTGGTGGGAGCCCCCCTGGAGGACAACCACGC AGGAGCC 1800 

554 QDSYNDVVVGAPLEDNHAGA 573 

1801 ATCTACATCTTCCACGGCTTCCGAGGCAGCATCCTGAAGACACCTAAGCAGAGAATCACA 1860 

574 IYIFHGFRGS1LKTPKQRIT 593 

1861 GC CTCAGAGCTGGCTAC CGGC C TCCAGTATTTTGGC TGCAGCATCCACGGGCAATTGGAC 19 2 0 

594 ASELATGLQYFGCS 1HGQLD 613 

1921 CTCAATGAGGATGGGCTCATCGACCTGGCAGTGGGAGCCCTTGGCAAC GCTGTGATTCTG 19 80 

614 LNEDGLIDLAVGALGNAVIL 633 

1981 TGGTCCCGCCCAGTGGTTCAGATCAATGCCAGCCTCCACTTTGAGCCATCCAAGATCAAC 2 040 

634 WSRPVVQINASLHFEPSKIN 653 

2041 ATCTTCCACAGAGACTGCAAGCGCAGTGGCAGGGATGCCACCTGCCTGGCCGCCTTCCTC 2100 

654 IFHRDCKRSGRD ATCITjAAFT, 6 73 

2101 TGCTTCACGCCCATCTTCCTGGCACCCCATTTCCAAACAACAACTGTTGGCATCAGATAC 2160 

674 C F T p I F L A P HFQTTTVGIRY 693 

2161 AACGCCACCATGGATGAGAGGCGGTATACACCGAGGGCCCACCTGGACGAGGGCGGGGAC 22 20 

694 NATMDERRYTPRAHLDEGGD 713 

22 21 CGATTCACCAACAGAGCCGTACTGCTCTCCTCCGGCCAGGAGCTCTGTGAGCGGATCAAC 22 80 
714 RFTNRAVLLSSGQELCERIN 733 

2 2 81 TTCCATGTCCTGGACACTGCTGACTACGTGAAGCCAGTGACCTTCTCAGTCGAGTATTCC 234 0 

734 FHVLDTADYVKPVTFSVEYS 753 

23 41 CTGGAGGACCCTGACCATGGCCCCATGCTGGACGACGGCTGGCCCACCACTCTCAGAGTC 2400 
754 LEDPDHGPMLDDGWPTTLRV 773 

24 01 TCGGTGCCCTTCTGGAACGGCTGCAATGAGGATGAGCACTGTGTCCCTGACCTTGTGTTG 2460 
774 SVPFWNGCNEDEHCVPDLVL 7 93 

2461 GATGCCCGGAGTGACCTGCCCACGGCCATGGAGTACTGCCAGAGGGTGCTGAGGAAGCCT 2520 

794 DARSDLPTAMEYCQRVLRKP 813 
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2521 GCGCAGGACTGCTCCGCATACACGCTGTCCTTCGACACCACAGTCTTCATCATAGAGAGC 2 5 8 0 
814 AQDCSAYTLSFDTTVFIIES 833 


25 81 ACACGCCAGCGAGTGGCGGTGGAGGCCACACTGGAGAACAGGGGCGAGAACGCCTACAGC 2 64 0 

834 TRQRVAVEAT LENRGENAYS 853 

2641 ACGGTCCTAAATATCTCGCAGTCAGCAAACCTGCAGTTTGCCAGCTTGATCCAGAAGGAG 27 00 

854 TVLNISQSANLQFASLIQKE 873 

27 01 GACTC AGACGGTAGCATTGAGTGTGTGAACGAGGAGAGGAGGCTCCAGAAGCAAGTCTGC 2 7 60 

874 DSDGSIECVNEERRLQKQVC 893 

2761 AACGTCAGCTATCCCTTCTTCCGGGCCAAGGCCAAGGTGGCTTTCCGTCTTGATTTTGAG 2 820 

894 NVSYPFFRAKAKVAFRLDFE 913 


28 21 TTCAGCAAATCCATCTTCCTACACCACCTGGAGATCGAGCTCGCTGCAGGCAGTGACAGT 2880 
914 FSKSIFLHHLEIELAAGSDS 933 


2 8 81 AATGAGCGGGACAGCACCAAGGAAGACAACGTGGCCCCCTTACGCTTCCACCTCAAATAC 2940 

934 NERDSTKEDNVAPLRFHLKY 953 

2 941 GAGGCTGACGTCCTCTTCACCAGGAGCAGCAGCCTGAGCCACTACGAGGTCAAGCTCAAC 3000 
954 EADVLFTRS SSLSHYEVKLN 973 

3 0 01 AGCTCGCTGGAGAGATACGATGGTATCGGGCCTCCCTTCAGCTGCATCTTCAGGATCCAG 3 060 
974 SSLERYDGIGPPFSCIFRIQ 993 

3 0 61 AACTTGGGCTTGTTCCCCATCCACGGGATTATGATGAAGATCACCATTCCCATCGCCACC 3120 

994 NLGLFPIHGIMMKITIPIAT 1013 


3121 AGGAGCGGCAACCGCCTACTGAAGCTGAGGGACTTCCTCACGGACGAGGTAGCGAACACG 3180 
1014 RSGNRLLK liRDFLTDEVANT 1033 


3181 TCCTGTAACATCTGGGGCAATAGCACTGAGTACCGGCCCACCCCAGTGGAGGAAGACTTG 3240 
1034 SCNIWGNSTEYRPTPVEEDL 1053 


3 241 CGTCGTGCTCCACAGCTGAATCACAGCAACTCTGATGTCGTCTCCATCAACTGCAATATA 33 00 
1054 RRAPQLNHSNSDVVS INCNI 1073 


33 01 CGGCTGGTCCCCAACCAGGAAATCAATTTCCATCTACTGGGGAACCTGTGGTTGAGGTCC 33 60 
1074 RLVPNQEINFHLLGNLWLRS 1093 
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3 3 61 CTAAAAGCACTCAAGTACAAATC C ATGAAAAT CATGGTC AACGCAGCCTTGCAGAGGCAG 3420 
1094 LKALK.YKSMKIMVNAALQRQ 1113 

34 21 TTCCACAGCCCCTTCATCTTCCGTGAGGAGGATCCCAGCCGCCAGATCGTGTTTGAGATC 3480 

1114 FHSPFIFREEDPSRQIVFEI 1133 

3 4 81 TCCAAGCAAGAGGAC TGGCAGGTCCCCATCTGGATC ATTGTAGGCAGCACCCTGGGGGGC 3540 

1134 SKQEDWQVPIW I IVGSTLGG 1153 

3 541 CTCCTACTGCTGGCCCTGCTGGTCCTGGCACTGTGGAAGCTCGGCTTCTTTAGAAGTGCC 3 600 

1154 h h L L A L L V LALWKLGFFRSA 1173 

3 601 AGGCGCAGGAGGGAGCCTGGTCTGGACCCCACCCCCAAAGTGCTGGAGTGAGGCTCCAGA 3660 

1174 RRRREPGLDPTPKVLE* 1190 

3 661 GGAGACTTTGAGTTGATGGGGGCC AGGACACC AGTC C AGGTAGTGTTGAGAC C CAGGC CT 3720 

3721 GTGGCCCCACCGAGCTGGAGCGGAGAGGAAGCCAGCTGGCTTTGCACTTGACCTCATCTC 3780 

3781 CCGAGCAATGGCGCCTGCTCCCTCCAGAATGGAACTCAAGCTGGTTTTAAGTGGAACTGC 3840 

3 841 CCTACTGGG AGACTGGGACACCTTTAACACAGACCC CTAGGGATTTAAAGGGACAC CC CT 3 900 

3 901 ACACACACCCAGGCCCACGCCAAGGCCTCCCTCAGGCTCTGTGGAGGGCATTTGCTGCCC 3 9 60 

3 9 61 CAGCTAC TAAGGTGC TAGGAATTCGTAATCATC C CCATC CTC CAGAGAAAC CCAGGGAGG 4 02 0 

4021 AAGACTGTAAATACGAACCCAATCTGCACACTCCAGGCCTCTAGTTCCAGAAGGATCCAA 4080 

4081 GACAAAACAGATCTGAATTCTGCCCTTTTCTCTCACCCATCCCACCCCTCC ATTGGCTCC 414 0 

4141 CAAGTCACACCCACTCCCTTCCCCATAGATAGGCCCCTGGGGCTCCCGAAGAATGAACCC 42 00 

42 01 AAGAGCAAGGGCTTGATGGTGACAGCTGC AAGCCAGGGATGAAGAAAGACTCTGAGATGT 42 60 

42 61 GGAGACTGATGGCCAGGCAAGTGGGACCAGGATACTGGACGCTGTCCTGAGATGAGAGGT 432 0 

4321 AGCCGGGCTCTGCACCCACGTGCATTCACATTGACCGCAACTCACACATTCCCCCACCAG 43 80 
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43 81 CTGCAGCCCCTTGCTCTCAGCTGCCAACCCTCCCGGGTCACTTTTGTTCCCAGGTACCTC 4440 

4441 ATGGGAAGCATGTGGATGACACAATCCCTGGGGCTGTGCATTCCCACGTCTTCTTGCTGC 4500 

4 501 AGCCTGCCCCTAGACATGGACGCACCGGCCTGGCTGCAGCTGGGCAGCAGGGGTAGGGGT 4560 

4561 AGGGAGCCTCCCCTCCCTGTATCACCCCCTCCCTACACACACACACACACACACACACAC 4620 

4621 ACACTGCCTCCCATCCTTCCCTCATGCCCGCCAGTGCACAGGGAAGGGCTTGGCCAGCGC 4680 

4681 TGTTGAGGGGTCCCCTCTGGAATGCACTGAATAAAGCACGTGCAAGGACTCCCGGAGCCT 4740 

4741 GTGCAGCCTTGGTGGCAAATATCTCATCTGCCGGCCCCCAGGACAAGTGGTATGACCAGT 4800 

4801 GATAATGCCCCAAGGACAAGGGGCGTGCCTGGCGCCCAGTGGAGTAATTTATGCCTTAGT 48 6 0 

4861 CTTGTTTTGAGGTAGAAATGCAAGGGGGACACATGAAAGGC ATCAGTCCCCCTGTGCATA 4 9 2 0 

4 921 GTACGACCTTTACTGTCGTATTTTTGAAAAATTAAAAATACAGTGTTTAAAAACAAAAAA 49 8 0 

4981 AAAAAAAAAAAAAAA 4995 
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1 GGCACGAGCTGTCATCCGTTTCC ATGCCGTGAGGTCCATTCACAGAACACATCCATGGC T 6 0 


61 CTCATGCTCAGTTTGGTTCTGAGTCTCCTCAAGCTGGGATCAGGGCAGTGGCAGGTGTTT 12 0 

3 LMLSLVLSLLKLGSG Q W Q V F 22 

121 GGGCCAGACAAGCCTGTCCAGGCCTTGGTGGGGGAGGACGCAGCATTCTCCTGTTTCCTG 180 

23GPDKPVQALVGEDAAFSCFL 42 

181 TCTCCTAAGACCAATGCAGAGGCCATGGAAGTGCGGTTCTTCAGGGGCCAGTTC TCTAGC 240 

43SPKTNAEAMEVRFFRGQFSS 62 

241 GTGGTCCACCTCTACAGGGACGGGAAGGACCAGCCATTTATGCAGATGCCACAGTATCAA 3 00 

S3VVHLYRDGKDQPFMQMPQYQ 82 

3 01 GGC AGGAC AAAACTGGTGAAGGATTCTATTGCGGAGGGGCGC ATCTCTCTGAGGCTGGAA 3 60 

S3 GRTKLVKDS I A E G R I SLRLE 102 

3 61 AACATTACTGTGTTGGATGCTGGCCTCTATGGGTGCAGGATTAGTTC C CAGTCTTACTAC 42 0 

103 NITVLDAGLYGCRISSQSYY 122 

421 CAGAAGGCCATCTGGGAGCTACAGGTGTCAGCACTGGGCTCAGTTCCTCTCATTTCCATC 4 80 

123 QKAIWELQVSALGSVPL1SI 142 

481 GCGGGATATGTTGATAGAGACATCCAGCTACTCTGTCAGTCCTCGGGCTGGTTCCCCCGG 54 0 

143AGYVDRDIQLLCQSSGWFPR 162 

541 CCCACAGCGAAGTGGAAAGGTCCACAAGGACAGGATTTGTCCACAGACTCCAGGACAAAC 600 

163 PTAKWKGPQGQDLSTDSRTN 182 

6 01 AGAGACATGCATGGCCTGTTTGATGTGGAGATCTCTCTGACCGTCCAAGAGAACGCCGGG 660 
183 RDMHGLFD VE I SLTVQENAG 202 

661 AGCATATCCTGTTCCATGCGGCATGCTCATCTGAGCCGAGAGGTGGAATCCAGGGTACAG 720 

203 SISCSMRHAHLSREVESRVQ 222 

721 ATAGGAGACTGGAGAAGAAAGCACGGACAGGCAGGTAAAAGAAAATATTCCTCTTCACAC 780 

223 IGDWRRKHGQAGKRKYSSSH 242 

7 81 ATTTATGACTCCTTTCCAAGTCTCTCGTTTATGGATTTTTATATCCTGAGGCCCGTGGGT 840 
243 IYDSFPSLSFMDFYILRPVG 262 
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841 CCCTGCAGAGCCAAGCTTGTGATGGGAACTCTGAAATTGCAGATTCTGGGGGAGGTGCAT 900 
263 PCRAKLVMGTLKLQILGEVH 282 


9 01 TTTGTAGAGAAGCCCCATAGCCTTCTTCAGATCTCTGGAGGGTCCACAACACTCAAAAAG 9 6 0 
283 FVEKPHSLLrQ I SGGSTTLRK 302 

9 6 1 GGTCCCAATCCTTGGTCTTTCCCTTCTCCCTGCGCCCTGTTTCCCACGTGAGCACGGAAC 1020 
303 GPNPWSFPSPCALFPT* 319 

1021 TGCCTGCTCTCTCTGCTTGCTTTCAGAATTGAGAGACGCCCGGAAACACGCAGGTACCAA 1080 

1081 CGCCTGAGAGGGTAACAGTGGGCATGGAGTAGGAAGATGACCAGTGAC AGATATGGAGCC 1140 

1141 CATCCAGCTTGTAGACAGCAAATCTGTGATGCCCGAATCCACCCCAGGGTGCAGCTGCCT 12 00 

1201 CTAAATACACTTCTTGGCCCAGGACTTGGAGGGAAAAGCGTAGGGACTGGGTCAGCTAGG 1260 

12 61 AGGGGTCACAGGCAAGACGCCAGGGAACTGAGGGCATTAGTAGCTGGCTTCTAGGGGTCT 13 2 0 
1321 GTGCAAAGGGGAACGAAGTGAAGTTAGCAGGAACTGGTGGGTGGAAGGAAGCTGAATCCT 13 8 0 

13 81 GGAGTCACTCAAGGTCTCACAAAGTCAAATAGAGGGCTTACGTGGGAGGGCAGTGGTAGG 144 0 
1441 GCTGGGTGAACATCTCATGGTTGAGCATCTCCAAGCATCAGTGAGGCACGGGGGCTGCCC 15 00 

15 01 TGGAGAAGGTACATGGCTGGTGGGATAGTGGGACTGGCCGGATCCT ACCCGGAGCCAGTC 15 6 0 
1561 TGCAGTGGGAGGGTCGACCTCTTGCTCCAGCCCAGATTTCGTCTTCAGTAACTCATGCTT 1620 
1521 CCTCTCTCCCCCACCGCACCCCAGTGGAGGTGACTCTGGATCCAGAGACGGCTCACCCGA 16 80 

16 81 AGCTCTGCGTTTCTGATCTGAAAACTGTAACCCATAGAAAAGCTCCTCAGGAGGTGCCTC 1740 

17 41 ACTCTGAGAAGAGATTTAC AAGGAAGAGTGTGGTGGCTTCTCAGGGTTTCCAAGCAGGGA 18 00 

18 01 AACATTACTGGGAGGTGGACGTGGGACAAAATGTAGGGTGGTATGTGGGAGTGTGTCGGG 1860 
18 61 ATGACGTAGACAGGGGGAAGAACAATGTGACTTTGTCTC CCAAC AATGGGTATTGGGTCC 192 0 
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1921 TC AGACTGACAACAGAACATTTGTATTTCACATTCAATC C C CATTTTATC AGC CTC C CCC 19 80 

19 81 CCAGCACCCCTCCTACACGAGTAGGGGTCTTCCTGGACTATGAGGGTGGGACCATCTCCT 2040 

2041 TCTTCAATACAAATGACCAGTCCCTTATTTATACCCTGCTGACATGTCAGTTTGAAGGCT 210 0 

2101 TGTTGAGACCCTATATCCAGCATGCGATGTATGACGAGGAAAAGGGGACTCCCATATTCA 216 0 

2161 TATGTCCAGTGTCCTGGGGATGAGACAGAGAAGAC CCTGCTTAAAGGGC C C C ACAC CACA 22 2 0 

22 21 GACCCAGACACAGCCAAGGGAGAGTGCTCCCGACAGGTGGCCCC AGCTTCCTCTCCGGAG 2 2 80 

22 81 CCTGCGCACAGAGAGTCACGCCCCCCACTCTCCTTTAGGGAGCTGAGGTTCTTCTGCCGT 23 4 0 

23 41 GAGCCCTGCAGCAGCGGCAGTCACAGCTTCCAGATGAGGGGGGATTGGCCTGACCCTGTG 2 4 00 

24 01 GGAGTCAGAAGCCATGGCTGCCCTGAAGTGGGGACGGAATAGACTCACATTAGGTTTAGT 24 60 

24 61 TTGTGAAAACTCCATCCAGCTAAGCGATCTTGAACAAGTCACAACCTCCCAGGCTCCTCA 2520 
2 5 21 TTTGC TAGTC ACGGACAGTGATTC C TGC CTC ACAGGTGAAGATTAAAGAGACAACGAATG 2580 

25 81 TGAATCATGCTTGCAGGTTTGAGGGCCACAGTGTTTGCTAATGGATGTGTTTTTATGATT 2 640 
2 641 ATACATTTTCCCCACCATAAAACTCTGTTTGCCTTAATTCCCACATTAATTTAACTTTTC 27 00 
27 01 CTCCTATACCCAAATCCACCCATGGAATAGTTAATTGGAACACCTGCCTTTGTGAGGCTC 27 60 
27 61 CAAAGAATAAAGAGGAGGTAGGATTTTTCACTGATTCTATAAGCCCAGC ATTACCTGATA 2820 
2 821 CCAAAACCAGGCAAAGAAAACAGAAGAAGAGG AAGGAAAAC TACAGGTCCATATCCCTCA 288 0 

2 881 TTAACACAGACACAAAAATTCTAAATAAAATTTTAACAAATTAAACTAAACAATATATTT 2940 
2941 AAAGATGATATATAACTACTC AGTGTGGTTTGTCCC ACAAATGC AGAGTTGGTTTAATAT 3 000 

3 001 TTAAATATCAACCAGTGTAATTCAGCACATTAATAAAGTAAAAAAAAAAAAAAAAAAAA 3059 
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PCT/US99/25031 


1 GGCACGAGGGGGCAGGCATGGGAGCCGCGCGCTCTCTCCCGGCGCCCACACCTGTCTGAG 60 

6 1 CGGCGCAGCGAGCCGCGGCCCGGGCGGGCTGCTCGGCGCGGAACAGTGCTCGGCATGGCA 12 0 

1 M A 2 

121 GGGATTCCAGGGCTCCTCTTCCTTCTCTTCTTTCTGCTCTGTGCTGTTGGGCAAGTGAGC 18 0 


3 9. I ? G LLFLLFFLLCAVG Q V S 


22 


181 CCTTACAGTGC CC CCTGGAAAC C CACTTGGCC TGCATACCGC CTCCCTGTCGTC TTGCC C 2 40 


23 PYSAPWKPTWPAYRLPVVL 


42 


2 41 C AGTCTACCCTCAATTTAGCCAAGCC AGACTTTGGAGCC GAAGCCAAATTAGAAGTATCT 3 00 
43QSTLNLAKPDFGAEAKLEVS 62 

3 01 TCTTCATGTGGACCCCAGTGTCATAAGGGAACTCCACTGCCCACTTACGAAGAGGCCAAG 3 60 
63SSCGPQCHKGTPLPTYEEAK 82 

361 CAATATCTGTCTTATGAAACGCTCTATGCCAATGGCAGCCGCACAGAGACGCAGGTGGGC 42 0 

83QYLSYETLYANGSRTETQVG 102 

42 1 ATCTACATCCTCAGCAGTAGTGGAGATGGGGCCCAACACCGAGACTCAGGGTCTTCAGGA 480 

SSGDGAQHRDSGSSG 122 


103 I Y I L 


481 AAGTCTCGAAGGAAGCGGCAGATTTATGGC TATGACAGCAGGTTCAGCATTTTTGGGAAG 540 

123KSRRKRQIYGYDSRFSIFGK 142 

541 GACTTCCTGCTCAACTACCCTTTCTCAACATCAGTGAAGTTATCCACGGGCTGCACCGGC 600 

143DFLLNYPFSTSVKLSTGCTQ 162 

601 ACC CTGGTGGCAGAGAAGCATGTC C TC ACAGCTGC CC ACTGC ATAC ACGATGGAAAAAC C 660 

163 TLVAEKHVLT AAHf 1 IHDGTCT l 82 

661 TATGTGAAAGGAACC CAGAAGCTTCGAGTGGGCTTCCTAAAGC C CAAGTTTAAAGATGGT 720 

183 Y V K G T O KLRVGFLKPKFKDG 202 

721 GGTCGAGGGGCCAACGACTCCACTTCAGCCATGCCCGAGCAGATGAAATTTCAGTGGATC 7 80 

203 GRGANDSTSAMPEQMKFQWI 222 

781 CGGGTGAAACGCACCCATGTGCCCAAGGGTTGGATCAAGGGCAATGCCAATGACATCGGC 84 0 

223 RVKRTHVPKGWIKGNANDIG 242 
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841 ATGGATTATGATTATGCCCTCCTGGAACTCAAAAAGCCCCACAAGAGAAAATTTA'TGAAG 9 00 

243 MDYDYALLELKKPHKRKFMK 262 

901 ATTGGGGTGAGCCCTCCTGCTAAGCAGCTGCCAGGGGGCAGAATTCACTTCTCTGGTTAT 960 

263 IGVSPPAKQLPGGRIHFSGY 282 

961 GACAATGACCGACCAGGCAATTTGGTGTATCGCTTCTGTGACGTCAAAGACGAGACCTAT 1020 

283 DNDRPGNLVYRFCDVKDETY 302 

10 21 GACTTGCTCTACCAACAATGCGATTCCCAGCCAGGGGCCAGCGGGTCTGGGGTCTATGTG 1080 

303 DLLYQQCDSQPGASGSGVYV 322 

10 SI AGGATGTGGAAGAGACAAC ACCAGAAGTGGGAGC GGAAAATTATTGGCATGATTTCAGGG 1140 

323 RMWKRQHQKWERKIIGMISG 342 

1141 CACCAGTGGGTGGACATGGATGGTTCCCCACAGGAATTCACACGTGGCTGTTC AGAGATC 12 00 

343 HQWVDMDGSPQEFTRGCSEI 362 

12 01 ACTCCTCTCCAATATATCCCAGATATATCTATTGGAGTATAAGGAAACTACCTGGATTGT 12 60 

363 TPLQYIPDISIGV* 376 

12 61 AGGGAGGGGTGACACAGTGTCCCTCCTGCAGCAACTAAGGTCGTCATGTTCTTATTTTAG 1320 
1321 GAGAGGCCAAATTGTTTTTTGTCATTGCCGTGCACACGTGTGTGTGTGTGTGTGTGTGTG 138 0 

13 81 TGTAAGGTGTCTTATAATCTTTTACCTATTTCTTACAATTGCAAGATGACTGGCTTTACT 1440 
1441 ATTTGAAAACTGGTTTGTGTATCATATCATATATCATTTAAGCAGTTTGAAGGCATACTT 15 00 
1501 TTGCATAGAAATAAAAAAAATACTGATTTGGGGCAATGAGGAATATTTGAC AATTAAGTT 15 6 0 
1561 AATCTTCACGTTTTTGCAAACTTTGATTTTTATTTCATCTGAACTTGTTTCAAAGATTTA 1620 
1621 TATTAAATATTTGGCATACAAAAAAAAAAAAAAAAAACGGGGGGGCCCGTACCCAATTCG 1680 
1681 CCCTATAGTGAGGCGATAC 1699 
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Decoration 'Decoration #1' : Shade (with solid black) residues that match the Consensus 
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1 GAATTCGGCAGAGCAGGTGCCCGACATGGCGAGTGTAGTGCTGCCGAGCGGATCCCAGTG 6 0 

1 MASVVLPSGSQC 12 

6 1 TGCGGCGGCAGCGGCGGCGGCGGCGCCTCCCGGGCTCCGGCTCCGGCTTCTGCTGTTGCT 120 

13 h h A A AAAAPPGLRLRLLLLL 3 2 

121 CTTCTCCGCCGCGGCACTGATCCCCACAGGTGATGGGCAGAATCTGTTTACGAAAGACGT 180 

33 FSAAALI PTGDG QNLFTKDV 52 

181 GACAGTGATCGAGGGAGAGGTTGCGACCATCAGTTGCCAAGTCAATAAGAGTGACGACTC 240 

53 TVIEGEVATISCQVNKSDDS 72 

2 41 TGTGATTCAGCTACTGAATCCCAACAGGCAGACCATTTATTTCAGGGACTTCAGGCCTTT 3 00 
73 VIQLLNPNRQTIYFRDFRPL 92 

3 01 GAAGGACAGCAGGTTTCAGTTGCTGAATTTTTCTAGCAGTGAACTCAAAGTATCATTGAC 3 60 
93 KDSRFQLLNFSSSELKVSLT 112 

361 AAACGTCTCAATTTCTGATGAAGGAAGATACTTTTGCCAGCTCTATACCGATCCCCCACA 420 

113 NVSISDEGRYFCQLYTDPPQ 132 

42 1 GGAAAGTTACACCACCATCACAGTCCTGGTCCCACCACGTAATCTGATGATCGATATCCA 480 

133 ESYTTITVLVPPRNLMIDIQ 152 

481 GAAAGACACTGCGGTGGAAGGTGAGGAGATTGAAGTCAACTGCACTGCTATGGCCAGCAA 540 

153 KDTAVEGEE I EVNCTAMASK 172 

541 GCCAGCCACGACTATCAGGTGGTTCAAAGGGAACACAGAGCT AAAAGGCAAATCGGAGGT 60 0 

173 PATTIRWFKGNTELKGKSEV 192 

601 GGAAGAGTGGTCAGACATGTACACTGTGACCAGTCAGCTGATGCTGAAGGTGCACAAGGA 660 

193 EEWSDMYTVT SQLMLKVHKE 212 

661 GGACGATGGGGTCCCAGTGATCTGCCAGGTGGAGCACCCTGCGGTCACTGGAAACCTGCA 72 0 

213 DDGVPVI CQVEHPAVTGNLQ 232 

721 GACCCAGCGGTATCTAGAAGTACAGTATAAGCCTCAAGTGCAC ATTCAGATGACTTATCC 78 0 

233 TQRYLEVQYKPOVHIQMTYP 252 

781 TCTACAAGGCTTAACCCGGGAAGGGGACGCGCTTGAGTTAACATGTGAAGCCATCGGGAA 840 

253 LQGLTREGDALELTCEAIGK 272 
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841 GCCCCAGCCTGTGATGC5TAACTTGGGTGAGAGTCGATGATGAAATGCCTCAACACGCCGT 900 

273 PQPVMVTWVKVDDEMPQHAV 292 

9 01 ACTGTCTGGGCCCAACCTGTTCATCAATAACCTAAACAAAACAGATAATGGTACATACCG 9 60 

293 LSGPNLFINNLNKTDNGTYR 312 

961 CTGTGAAGCTTCAAACATAGTGGGGAAAGCTCACTCGGATTATATGCTGTATGTATACGA 102 0 

313 CEASNIVGKAHSDYMLYVYD 332 

1021 TCCCCCCACAACTATCCCTCCTCCCACAACAACCACCACCACCACCACCACCACCACCAC 1080 

333 PPTTI PPPTTTTTTTTTTTT 352 

10 81 CACCATCCTTACCATCATCACAGATTCCCGAGCCAGGTGAAGAAGGCTCGATCAGGGC AG 1140 

353 TILTIITDSRAR* 365 

1141 TGGATCATGCCGTGATCGGTGGCGTCGTGGCGGTGGTGGTGTTCGCCATGCTGTGCTTGC 1200 

12 01 TCATCATTCTGGGGCGCTATTTTGCCAGACATAAAGGTACATACTTCACTCATGAAGCCA 1260 

12 61 AAGGAGCCGATGACGCAGC AGACGCAGAC ACAGCTATAATCAATGC AGAAGGAGGACAGA 1320 
1321 ACAACTCCGAAGAAAAGAAAGAGTACTTCATCTAGATCAGCCTTTTTGTTTCAATGAGGT 13 80 

13 81 GTCCAACTGGCCCTATTTAGATGATAAAGAGACAGTGATATTGGAACTTGCGAGAAATTC 1440 
1441 GTGTGTTTTTTTATGAATGGGTGGAAAGGTGTGAGACTGGGAAGGCTTGGGATTTGCTGT 1500 
1501 GTAAAAAAAAAAAAAAAAAA 1520 
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x TTTTTTTTTTTGAGACGGAGTCTCGCTCTTTCACCCAGGCCAGACTGAAGTGGCGCAGTC 60 

61 TCGGATCACTGAAAAGCTCAGCCTCACGGGATCACGACCATTCTCCTGCCTCAGCCTCCC 120 

121 GAGTGGCTGGGACTAAAGGCGCCCGCCACCGACGCCCGGACTAATTTTTTGTATTTATAG 180 

181 TAGAGACGGGGTTTCACCGTGTTAGCCAAGATGGTCTCGATCTCTTGAACTCGTGATCCG 2 4 0 

241 CCCGCGTCAGCCTCCCAAAGTGCTGGGATTACAGGCGTGAGCCACAGCGCCCGGCCTGTA 3 00 

301 AAGATTTCTTGAGCAAATGAATGAGTAAATGAAAGGAGTGCTCAAATTTCTTTTATTCTA 3 60 

3 61 AAAAATGTTCCCCTTTTTTAGAAAATGCTCTGTAGCTTTTGTAGGTCTTTCCTGCACTCA 420 
421 AACATCCACTCCTTACCCTTTCCAATCTCCCTTTTCTTCTCAACCCATAGAATGTACTTC 480 

4 81 CATGTCTACCATTCCACCAGAACTACTCTAACCAAGGCTACCCAGGATATCCTTATTTCT 540 

541 TTCAACTTCTTI'GACATGCTCAGTCGCATTTGGCACTGTTAATGTCTTCCTCCTCTTTGA 600 

1 M S S S S L K 7 

601 AACACCTCCTTTGCATGGCACTATCCTGGTTTTCTTCCTTCATTTCAGGAGAAACTTCAT 660 

8 HLLCMAL5WFSSFISG E T S F 27 

661 TCTCCTTACTGAATTCTTTCTTCCTCCCCTATCCATCATCTAGATGTTGTTGTTTCTCAG 720 

2 8 SLLNSFFLPYPSSRCCCFSV 47 

721 TGCAGTGTTCAATCCTAGACCCCTTTTCATGTAACTCAATGCGTTTTCCTTGGGAGAATT 780 

48 QCSILDPFSCNSMRFPWEN* 67 

781 AATTCCCTTCCCTGGTGTCACTNTGCC 807 
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1 GGCACGAGCCGGACCCTGCCGCCCTGCCACTATGTCCCGCCGCTCTATGCTGCTTGCCTG 60 

1 MSRRSMLLAW 10 

61 GGCTCTCCCCAGCCTCCTTCGACTCGGAGCGGCTCAGGAGACAGAAGACCCGGCCTGCTG 12 0 

11 ALPS LLKLGAA QETEDPACC 30 

121 CAGCCCCATAGTGCCCCGGAACGAGTGGAAGGCCCTGGCATCAGAGTGCGCCCAGCACCT 180 

31 S P I VPRHEWK ALASECAOHT. 50 

181 GAGCCTGCCCTTACGCTATGTGGTGGTATCGCACACGGCGGGCAGCAGCTGCAACACCCC 240 

51 £ LPLRYV VVSHTAGSSCNTP 70 

241 CGCCTCGTGCCAGCAGCAGGCCCGGAATGTGCAGCACTACCACATGAAGACACTGGGCTG 300 

71 fi SCOOOARNV QHYHMKTLGW 90 

3 01 GTGCGACGTGGGCTACAACTTCCTGATTGGAGAAGACGGGCTCGTATACGAGGGCCGTGG 3 60 

91 CDVGYK FLIGEnOLVYEfiRn HO 

3 61 CTGGAACTTCACGGGTGCCCACTCAGGTCACTTATGGAACCCCATGTCCATTGGCATCAG 42 0 

111 W KT F T G A H SGHLWNPMSIGIS 130 

421 CTTCATGGGCAACTACATGGATCGGGTGCCCACACCCCAGGCC ATCCGGGCAGCCCAGGG 4 8 0 

131 FMGNYMDRVPTPQAIRAAQG 150 

481 TCTACTGGCCTGCGGTGTGGCTCAGGGAGCCCTGAGGTCCAACTATGTGCTCAAAGGACA 540 

151 L^ACGVAQGALRSNYVLKGH 170 

541 CCGGGATGTGCAGCGTACACTCTCTCCAGGCAACCAGCTCTACCACCTCATCCAGAATTG 6 00 

171 RDVQRTLSPGNQLYHLIQNW 190 

601 GCCACACTACCGCTCCCCCTGAGGCCCTGCTGATCCGCACCCCATTCCTCCCCTCCCATG 660 

191 P H Y R S P * 197 

661 GCCAAAAACCCCACTGTCTCCTTCTCCAATAAAGATGTAGCTCAAAAAAAAAAAAAAAAA 720 


721 AAAAAA 726 
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<110> Human Genome Sciences, Inc. 

<120> 12 Human Secreted Proteins 

<130> PF489 

<140> Unas signed 
<141> 1999-10-26 

<150> 60/105,971 
<151> 1998-10-28 

<160> 147 

<170> Patentln Ver. 2.0 


<210> 1 

<211> 733 

<212> DNA 

<213> Homo sapiens 

<400> 1 

gggatccgga gcccaaatct 
aattcgaggg tgcaccgtca 
tctcccggac tcctgaggtc 
tcaagttcaa ctggtacgtg 
aggagcagta caacagcacg 
ggctgaatgg caaggagtac 
agaaaaccat ctccaaagcc 
catcccggga tgagctgacc 
atccaagcga catcgccgtg 
ccacgcctcc cgtgctggac 
acaagagcag gtggcagcag 
acaaccacta cacgcagaag 
gactctagag gat 


<210> 2 

<211> 5 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> Site 

<222> (3) 

<22 3> Xaa equals any of the twenty naturally ocurring L-amino acids 

<400> 2 

Trp Ser Xaa Trp Ser 
1 5 

<210> 3 

<211> 86 

<212> DNA 

<213> Homo sapiens 


tctgacaaaa ctcacacatg cccaccgtgc ccagcacctg 6 0 

gtcttcctct tccccccaaa acccaaggac accctcatga 120 

acatgcgtgg tggtggacgt aagccacgaa gaccctgagg 18 0 

gacggcgtgg aggtgcataa tgccaagaca aagccgcggg 24 0 

taccgtgtgg tcagcgtcct caccgtcctg caccaggact 3 00 

aagtgcaagg tctccaacaa agccctccca acccccatcg 3 60 

aaagggcagc cccgagaacc acaggtgtac accctgcccc 420 

aagaaccagg tcagcctgac ctgcctggtc aaaggcttct 480 

gagtgggaga gcaatgggca gccggagaac aactacaaga 540 

tccgacggct ccttcttcct ctacagcaag ctcaccgtgg 6 00 

gggaacgtct tctcatgctc cgtgatgcat gaggctctgc 660 

agcctctccc tgtctccggg taaatgagtg cgacggccgc 72 0 

733 
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2 

<400> 3 

gcgcctcgag atttccccga aatctagatt tccccgaaat gatttccccg aaatgatttc 6 0 

cccgaaatat ctgccatctc aattag gg 

<210> 4 

<211> 27 

<212> DNA 

<213> Homo sapiens 

<400> 4 

gcggcaagct ttttgcaaag cctaggc 27 

<210> 5 

<211> 271 

<212> DNA 

<213> Homo sapiens 


<400> 5 

ctcgagattt ccccgaaatc tagatttccc cgaaatgatt tccccgaaat gatttccccg 60 

aaatatctgc catctcaatt agtcagcaac catagtcccg cccctaactc cgcccatccc 120 

gcccctaact ccgcccagtt ccgcccattc tccgccccat ggctgactaa ttttttttat 180 

ttatgcagag gccgaggccg cctcggcctc tgagctattc cagaagtagt gaggaggctt 2 40 

ttttggaggc ctaggctttt gcaaaaagct t 271 


<210> 6 

<211> 32 

<212> DNA 

<213> Homo sapiens 

<400> 6 

gcgctcgagg gatgacagcg atagaacccc gg 

<210> 7 

<211> 31 

<212> DNA 

<2 1 3 > Homo sapiens 

<400> 7 

gcgaagcttc gcgactcccc ggatccgcct c 

<210> 8 

<211> 12 

<212> DNA 

<213> Homo sapiens 

<400> 8 
ggggactttc cc 

<210> 9 

<211> 73 

<212> DNA 

<213> Homo sapiens 
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<400> 9 

gcggcctcga ggggactttc ccggggactt tccggggact ttccgggact ttccatcctg 60 
ccatctcaat tag 73 


<210> 10 

<211> 256 

<212> DNA 

<213> Homo sapiens 

<400> 10 

ctcgagggga ctttcccggg 
caattagtca gcaaccatag 
cagttccgcc cattctccgc 
ggccgcctcg gcctctgagc 
cttttgcaaa aagctt 


<210> 11 

<211> 2609 

<212> DNA 

<213> Homo sapiens 


<400> 11 

gcgtccgggc caaaatgctg agaacgtcca ctcctaatct gtgtggtggt ctgcattgcc 6 0 

gggccccctg gctctcttct ggcattctct gcctctgcct catattcttg ttaggccagg 12 0 

tgggcttgct gcagggacac ccccagtgcc tggattacgg gccccctttc cagccccctc 180 

tgcaccttga gttttgctct gactatgagt ccttcggctg ctgtgatcag cacaaggacc 240 

gccgcatcgc tgcccggtac tgggacatca tggaatattt tgatctgaag agacatgagc 300 

tgtgtggaga ttacattaaa gacatccttt gccaggagtg ctcgccctac gcagcccact 360 

ctacgacgcc gaaaacaccc agacgcctct ccggaatctc ccgggcctct gctctgatta 42 0 

ctgctctgcc ttccattcta actgtcactc agccatttcc ctgctgacca atgaccgcgg 480 

cctccaggag tctcatggaa gggacggtac ccgcttctgc cacctcctgg accttcctga 54 0 

caaggactat tgcttcccta atgtcctgag gaacgactat ctcaaccgcc acctgggcat 600 

ggtggcccaa gatcctcagg gctgcctgca gctctgcctg agcgaggtgg ccaacgggct 660 

gaggaacccc gtctccatgg tccatgctgg ggacggcacc catcgcttct ttgttgccga 72 0 

gcaggtagga gtggtgtggg tctacctccc tgatgggagt cgcctggagc aacccttcct 7 80 

ggacctcaag aacatcgtgt tgaccacccc atggatcggg gatgagagag gcttcttggg 840 

gttggctttt caccccaaat tccgccacaa tcgcaagttc tatatttatt attcgtgcct 900 

ggacaagaag aaggtagaaa agatccgaat tagtgagatg aaggtttctc gggctgatcc 960 

taacaaagct gacctgaaat cagagagggt catcttggag attgaagaac cagcctcaaa 102 0 

ccataatggc ggacaacttc tttttggcct ggatggctat atgtacatat tcactgggga 108 0 

cgggggacag gctggagatc cctttggcct gtttggaaat gctcagaaca aaagttccct 114 0 

gctgggaaaa gttttaagga tcgatgtgaa cagggcaggc tcacatggca agcggtaccg 12 0 0 

agtcccctcg gacaatccat ttgtttctga gccaggggcc caccccgcca tctatgccta 12 6 0 

tgggatcagg aacatgtggc gttgtgctgt ggaccgaggg gaccccatca cgcgccaggg 132 0 

ccgaggccgg atattctgtg gggacgtggg ccagaacagg tttgaagagg ttgacctcat 13 8 0 

tttgaaaggt ggaaactatg gctggagagc aaaggaaggg tttgcatgtt atgacaaaaa 1440 

actttgtcac aatgcctctt tggatgatgt tctgccaatc tatgcttatg gccatgcagt 1500 

ggggaagtca gtcactggag gttatgtcta tcgtggttgt gaatccccaa atctcaatgg 15 60 

cctgtatatc tttggagact tcatgagtgg tcgacttatg gctttgcagg aagatagaaa 162 0 

aaacaagaaa tggaagaagc aggatctttg cctgggcagc accacgtcct gtgccttccc 168 0 

agggctgatc agcacccata gcaagttcat catctccttt gctgaagatg aagcagggga 1740 

gctgtatttc ctggcgacct cttacccaag tgcctatgca ccacgtggat ctatttacaa 1800 

gtttgttgac ccctcaaggc gagcaccccc aggcaagtgc aaatacaagc cagtgcccgt 1860 

gagaaccaag agtaagcgga tcccgttcag accactcgcc aagacagtct tggacttgct 192 0 

aaaggaacaa tcagagaaag ctgctagaaa atcttccagt gcaaccttag cttctggccc 1980 


gactttccgg ggactttccg ggactttcca tctgccatct 60 

tcccgcccct aactccgccc atcccgcccc taactccgcc 120 

cccatggctg actaattttt tttatttatg cagaggccga 180 

tattccagaa gtagtgagga ggcttttttg gaggcctagg 24 0 
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agcccagggt ttgtctgaga aaggctcctc caagaagctg gcttctccta caagcagcaa 2 040 

gaatacattg cgagggcctg gtacaaagaa gaaagccaga gtggggcccc acgtccgcca 2100 

gggcaagagg aggaagagcc tgaaaagcca cagtggcagg atgaggccat cagcagagca 2160 

gaagcgagct ggcagaagtc tcccttgacc tattggtcaa ggtggccgac agggtgacgt 222 0 

gagagaggag agccacctca tcaaatgaaa gtcactgctg aataaagacc ttagaagtct 22 80 

gggaagccag ggtagaggtg gggcagggcg gttttcctct ccctgggaaa tcttgctgtc 2 340 

tactgaataa ataaatgcac cttctctgta tgcagtgctt ctgtgggaga ccatatccca 2400 

gattgctggt gcacctgggt tatggtaagc actatccatg agcctgcttg gaatcacact 2460 

ggatgtctcc gttttgtctt gtaaatgcct acaacctgag gtaataaatc aacatttgct 2520 

caaactggca aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa 2 5 80 

aaaaaaaaaa aaaaaaaaaa aaaaaaaaa 7 


<210> 12 

<211> 2499 

<212> DNA 

<213> Homo sapiens 


<400> 12 

caccagcacc ccgcccagag cagtgccgct gcccaaatcc tcgcaggcag ctcatcaacg 6 0 

caattgcaac tccggctgga gccccggacc tgcaagcctg ggtgtccgtg ggtccgtctg 12 0 

cccagccatc tgctggtggc acctctccct cctgccgcct ccctcggtga accccacctt 180 

gcagaagtgc agctcgcccg gagcagccca ggagctcagc atgcgtcccc caggcttcag 24 0 

gaacttcttg ctgctggcgt cctcccttct ctttgctggg ttgtcagctg ttcctcaaag 300 

cttctcgcca tctctgagga gctggccggg cgccgcctgc aggctgtccc gggccgagtc 3 60 

ggagcgacgc tgccgcgcac ctgggcagcc cccgggggcc gcgctgtgcc acggccgggg 42 0 

ccgctgcgac tgcggcgtct gcatctgcca cgtgactgag ccgggcatgt tcttcgggcc 480 

cctgtgtgag tgccatgagt gggtgtgcga gacctacgac gggagcacct gtgcaggcca 540 

tggtaagtgt gactgtggca agtgcaagtg tgaccaggga tggtatgggg atgcttgcca 600 

gtacccaact aactgtgact tgacaaagaa gaaaagtaac caaatgtgca agaattcaca 660 

agacatcatc tgctctaatg caggtacatg tcactgtggc aggtgtaagt gtgataattc 72 0 

agatggaagt ggacttgtgt atggtaaatt ttgtgagtgt gacgatagag aatgcataga 780 

cgatgaaaca gaagaaatat gtggaggcca tgggaagtgt tactgtggaa actgctactg 840 

caaggctggt tggcatggag ataaatgtga attccagtgc gatatcaccc cctgggaaag 900 

caagcgaaga tgcacgtctc cagatggcaa aatctgcagt aacagaggga cttgtgtatg 960 

tggtgaatgt acctgtcacg atgttgatcc gactggggac tggggagata ttcatgggga 102 0 

cacctgtgaa tgtgatgaga gggactgtag agctgtctat gaccgatatt ctgatgactt 108 0 

ctgttcaggt catggacagt gtaattgcgg aagatgtgac tgcaaagcag gctggtatgg 114 0 

gaagaagtgt gagcacccac agtcctgcac gctgtcagct gaggagagca tcaggaagtg 1200 

ccagggaagc tcggatctgc cttgctctgg gaggggtaaa tgtgaatgtg gcaaatgcac 12 60 

ctgctatcct ccaggagatc gccgggtgta tggcaagact tgtgagtgtg atgatcgccg 132 0 

ctgtgaagac ctcgatggtg tggtctgtgg aggccacggc acatgttcct gtggtcgctg 13 8 0 

tgtttgtgag agaggatggt ttggaaagct ctgccaacat ccgcggaagt gtaacatgac 1440 

ggaagaacaa agcaagaatc tgtgtgaatc agcagatggc atattgtgct cggggaaggg 1500 

ttcttgtcat tgtgggaagt gcatttgttc tgctgaagag tggtatattt ctggggagtt 156 0 

ctgtgactgt gatgacagag actgcgacaa acatgatggt ctcatttgta cagggaatgg 1620 

aatatgtagc tgtggaaact gtgaatgctg ggatggatgg aatggaaatg catgtgaaat 1680 

ctggcttggc tcagaatatc cttaacaatt acatgagaga ggtctggatt cttatttttt 1740 

ctgggccatt agaacatata aatgcgaagg aaaccatgta tattcaccac taggacaggt 18 00 

taaaaagacc attgtatgtt tttctatttc tgaattacga atgaaatccg agtacctatt 1860 

agaaatgagt tatgcaaatt tagatgcaaa taacattaga aaaaaaagat tcttccataa 192 0 

ttaacataag tggttcctaa cgagagcaat ttttccaccc aaaagtcatt tggcaacatc 1980 

tacagacaat tttgattgtc acactgggtc gggtaggaag gtatgctgca gacatttggt 2040 

gggtagaggc cagggatgct gctgagcatc ccgcagtgta caggacagcc cccaaacaag 2100 

gaattatcca gccccaaatg ccaatagggc tcagactgag aaacattgag ttatatggct 2160 

attagaaatc cacattctta cacaagaaag accatattag aatctaagga aaacatgcat 2220 

attcacatta attaatcgat cagatttttc cagaattccg tatcagtcac cattttaata 22 80 

tggggacaat gaagacaagc acacaggagg tagaatatca gagtggggct ggatcaaggg 2 3 40 
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caaaaactgg tcattaagtc atctgacatt aaatcattta gccactaagt tatttgtgta 2400 
ctctcacttt aaactcacca aagaagattc tcttaaagaa attatgaaaa atgtacaatt 2460 
taacatttta aataaatagt gacagaagtt gtttaaaaa 2499 


<210> 13 

<211> 1339 

<212> DNA 

<213> Homo sapiens 


<400> 13 

ggcacgagag cgtgacccag ctgcggccgg ccagccatgg agactggagc gctgcggcgc 6 0 

ccgcaacttc tcccgttgct gctgctgctc tgcggtgggt gtcccagagc aggcggctgc 12 0 

aacgagacag gcatgttgga gaggctgccc ctgtgtggga aggctttcgc agacatgatg 180 

ggcaaggtgg acgtctggaa gtggtgcaac ctgtccgagt tcatcgtgta ctatgagagt 2 40 

ttcaccaact gcaccgagat ggaggccaat gtcgtgggct gctactggcc caaccccctg 3 00 

gcccagggct tcatcaccgg catccacagg cagttcttct ccaactgcac cgtggacagg 3 60 

gtccacttgg aggacccccc agacgaggtt ctcatcccgc tgatcgttat acccgtcgtt 420 

ctgactgtcg ccatggctgg cctggtggtg tggcgcagca aacgcaccga cacgctgctg 4 80 

tgagggt ccc ggtgagatgg agtgggtcac acctggcaag ctggaagaaa gttccctggg 540 

gatgggagag cgggtgggtg ctgccaatct ccagctactg tggccacacc ccacctggtc 600 

atgggcagac ccctcccttc ctgggctgac ctgctccctc gaggccagcc tgctccctgg 660 

ctgaggctca ggctatccgc ccaagctctt tgctcattct agggccagtg gaggaaaatg 720 

tgataaggcc agagcttgfcg tgctgggcac agaaatcacc tgctgcatcc tgtgctccgc 780 

agctgggccg gacctctgcc cgcaggtttc tatgctgttt cttagcacag aatccagcct 840 

agccttagcc gcagtctaag ccctgcttgg actaggactc cttgcttgac cccatctctg 900 

gttcctgccc tggctcctgc accagcccca gctcctgcct acatccaggc agaaagatag 960 

caggggctct tggaagacgt tccgtgctgt gacctccgag ccctcctggt gggaagacag 102 0 

ctggaaaggc tgggaggaga agggaggggt tgggggttcc caggagccat gcgtggcctg 1080 

cagagtccat tccatcatga tgctgtgccc gctatgggct gtgtccatga ccagaggctg 1140 

gagtgggggt gtgttagagc ccctcaccgg gacttgctgt gcggatgggg cctgggcctc 1200 

cttcctacag gggctcctct gtgggtgagg ggccctctgg aatggcatcc catgagcttg 12 60 

tggcctctat ctgctaccat ctgtgtttta tctgagtaaa gttaccttac ttctggaaaa 13 2 0 

aaaaaaaaaa aaaaaaaaa 13 3 9 


<210> 14 

<211> 1389 

<212> DNA 

<213> Homo sapiens 


<400> 14 

ggcacgagga cagcctccac cagagtcccc acctttctgg aagctgcagg gctctccatc 60 

caggatccag aagcattgaa ggggaccagc cgctgaaggg attctcagtc ccatctgact 12 0 

ccccatgagg ctcctggctt tcctgagtct gctggccttg gtgctgcagg agacagggac 18 0 

agcttctctc ccaaggaagg agaggaagag gagagaggag cagatgccca gggaaggcga 240 

ttcctttgaa gttctgcctc tgcggaatga tgtcctgaac ccagacaact atggtgaagt 3 00 

cattgacctg agcaactatg aggagctcac agattatggg gaccaactcc ccgaggttaa 360 

ggtgactagc ctcgctcctg caaccagcat cagtcccgcc aagagcacta cggctccagg 42 0 

gacaccctcg tcaaacccca cgatgaccag acctactaca gcagggctgc tactgagttc 48 0 

ccagcccaac catggtctgc ccacctgcct ggtctgcgtg tgcctcggtt cctctgtgta 54 0 

ttgcgatgac attgacctag aggacattcc tcctcttcct cggaggactg cctacctgta 60 0 

tgcacgcttc aaccgcatca gccgtatcag ggccgaagac ttcaaagggc tgacaaagtt 660 

gaagaggatt gacctctcca acaacctcat ttcctccatc gataatgatg ccttccgcct 72 0 

gctacatgcc ctccaggacc tcatcctccc agagaaccag ttggaagctc tgcccgtgct 780 

gcccagtggc attgagttcc tggatgtccg cctaaatcgg ctccagagct cggggataca 840 

gcctgcagcc ttcagggcaa tggagaagct gcagttcctt tacctgtcag acaacctgct 900 

ggattctatc ccggggcctt tgcccccgag cctgcgctct gtacacctgc agaataacct 960 
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gatagagacc atgcagagag 
gcagctggaa gacatccgcc 
ctacttctgc ctgcctcggc 
cctcccaggt catctcttgg 
cgcctccctc ctgtggccgc 
ctgcttcttt ccccacagct 
aggaccacct ccttcctgcc 


acgtcttctg 
tggatggcaa 
tccccatcgg 
accagcgggc 
cggcagcatg 
ctcacgtctc 
tcattgtaat 


tgaccccgag 
ccccatcaac 
ccgcttcacg 
atcacattct 
gacaaaggtc 
ccttctccct 
aaaattcccc 


gagcacaaac 
ctcagcctct 
tagctcggag 
ccagcagccg 
tccatgcagg 
gcgggtgaca 
acactgaaaa 


aagaagccca 


aaaaaaaaaa 


acacccgcag 
tccccagcgc 
cccttccact 


gggaggaggc 


ccatctcaca 


1020 
1080 
1140 
1200 
1260 
1320 
1380 
1389 


aaaaaaaaa 


<210> 15 

<211> 2295 

<212> DNA 

<213> Homo sapiens 

<400> 15 

ccacgcgtcc gagagaacag gcctgtctca ggcaggccct gcgcctccta tgcggagatg 6 0 

ctactgccac tgctgctgtc ctcgctgctg ggcgggtccc aggctatgga tgggagattc 12 0 

tggatacgag tgcaggagtc agtgatggtg ccggaggcct gtgacatctc tgtgccctgc 180 

tctttctcct acccccgaca agactggaca gggtctaccc cagcttatgg ctactggttc 240 

aaagcagtga ctgagacaac caagggtgct cctgtggcca caaaccacca gagtcgagag 3 00 

gtggaaatga gcacccgggg ccgattccag ctcactgggg atcccgccaa ggggaactgc 3 60 

tccttggtga tcagagacgc gcagatgcag gatgagtcac agtacttctt tcgggtggag 42 0 

agaggaagct atgtgagata taatttcatg aacgatgggt tctttctaaa agtaacagtg 480 

ctcagcttca cgcccagacc ccaggaccac aacaccgacc tcacctgcca tgtggacttc 540 

tccagaaagg gtgtgagcgc acagaggacc gtccgactcc gtgtggccta tgcccccaga 6 00 

gaccttgtta tcagcatttc acgtgacaac acgccagccc tggagcccca gccccaggga 6 60 

aatgtcccat acctggaagc ccaaaaaggc cagttcctgc ggctcctctg tgctgctgac 72 0 

agccagcccc ctgccacact gagctgggtc ctgcagaaca gagtcctctc ctcgtcccat 7 80 

ccctggggcc ctagacccct ggggctggag ctgcccgggg tgaaggctgg ggattcaggg 840 

cgctacacct gccgagcgga gaacaggctt ggctcccagc agcgagccct ggacctctct 90 0 

gtgcagtatc ctccagagaa cctgagagtg atggtttccc aagcaaacag gacagtcctg 960 

gaaaaccttg ggaacggcac gtctctccca gtactggagg gccaaagcct gtgcctggtc 102 0 

tgtgtcacac acagcagccc cccagccagg ctgagctgga cccagagggg acaggttctg 10 8 0 

agcccctccc agccctcaga ccccggggtc ctggagctgc ctcgggttca agtggagcac 1140 

gaaggagagt tcacctgcca cgctcggcac ccactgggct cccagcacgt ctctctcagc 12 00 

ctctccgtgc actactcccc gaagctgctg ggcccctcct gctcctggga ggctgagggt 12 60 

ctgcactgca gctgctcctc ccaggccagc ccggccccct ctctgcgctg gtggcttggg 132 0 

gaggagctgc tggaggggaa cagcagccag gactccttcg aggtcacccc cagctcagcc 13 80 

gggccctggg ccaacagctc cctgagcctc catggagggc tcagctccgg cctcaggctc 14 40 

cgctgtgagg cctggaacgt ccatggggcc cagagtggat ccatcctgca gctgccagat 15 00 

aagaagggac tcatctcaac ggcattctcc aacggagcgt ttctgggaat cggcatcacg 15 60 

gctcttcttt tcctctgcct ggccctgatc atcatgaaga ttctaccgaa gagacggact 1620 

cagacagaaa ccccgaggcc caggttctcc cggcacagca cgatcctgga ttacatcaat 1680 

gtggtcccga cggctggccc cctggctcag aagcggaatc agaaagccac accaaacagt 17 40 

cctcggaccc ctcttccacc aggtgctccc tccccagaat caaagaagaa ccagaaaaag 18 00 

cagtatcagt tgcccagttt cccagaaccc aaatcatcca ctcaagcccc agaatcccag 18 60 

gagagccaag aggagctcca ttatgccacg ctcaacttcc caggcgtcag acccaggcct 192 0 

gaggcccgga tgcccaaggg cacccaggcg gattatgcag aagtcaagtt ccaatgaggg 1980 

tctcttaggc tttaggactg ggacttcggc taggaggaag gtagagtaga agttgaagat 2 040 

aaccgagtgc aagagtttcc ttctctccct ctctctctct ctttctctct ctctctctct 2100 

ttctctctct ttttaaaaaa actctggcca cggcacagtg gctcacgcct gtaatcccag 2160 

cactttggga agttgaagtt gggccaaatc ccctgagtcc ggattcgaaa acagcctggc 222 0 

caacttgtga aaacccgtct ctacttaaaa atacccaaat tacctgggca ttgtggcagg 22 8 0 

ggcctgttta tcctt 2295 


<210> 16 
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<211> 1748 

<212> DMA 

<213> Homo sapiens 


<400> 16 

agacgttccc tcgcggccct ggcacctcca accccagata tgctgctgct gctgctgctg 60 

cccctgctct gggggaggga gagggtggaa ggacagaaga gtaaccggaa ggattactcg 12 0 

ctgacgatgc agagttccgt gaccgtgcaa gagggcatgt gtgtccatgt gcgctgctcc 18 0 

ttctcctacc cagtggacag ccagactgac tctgacccag ttcatggcta ctggttccgg 240 

gcagggaatg atataagctg gaaggctcca gtggccacaa acaacccagc ttgggcagtg 3 00 

caggaggaaa ctcgggaccg attccacctc cttggggacc cacagaccaa aaattgcacc 36 0 

ctgagcatca gagatgccag aatgagtgat gcggggagat a.cttctttcg tatggagaaa 42 0 

ggaaatataa aatggaatta taaatatgac cagctctctg tgaacgtgac agccttgacc 48 0 

cacaggccca acatccttat ccccggtacc ctggagtctg gctgcttcca gaatctgacc 540 

tgctctgtgc cctgggcctg tgagcagggg acgcccccta tgatctcctg gatggggacc 600 

tctgtgtccc ccctgcaccc ctccaccacc cgctcctcag tgctcaccct catcccacag 660 

ccccagcacc acggcaccag cctcacctgt caggtgacct tgcctggggc cggcgtgacc 72 0 

acgaacagga ccatccaact caatgtgtcc taccctcctc agaacttgac tgtgactgtc 7 80 

ttccaaggag aaggcacagc atccacagct ctggggaaca gctcatctct ttcagtccta 840 

gagggccagt ctctgcgctt ggtctgtgct gttgacagca atccccctgc caggctgagc 900 

tggacctgga ggagtctgac cctgtacccc tcacagccct caaaccctct ggtactggag 960 

ctgcaagtgc acctggggga tgaaggggaa ttcacctgtc gagctcagaa ctctctgggt 1020 

tcccagcacg tttccctgaa cctctccctg caacaggagt acacaggcaa aatgaggcct 1080 

gtatcaggag tgttgctggg ggcggtcggg ggagctggag ccacagccct ggtcttcctc 1140 

tccttctgtg tcatcttcat tgtagtgagg tcctgcagga agaaatcggc aaggccagca 1200 

gcggacgtgg gagacatagg catgaaggat gcaaacacca tcaggggctc agcctctcag 1260 

ggtaacctga ctgagtcctg ggcagatgat aacccccgac accatggcct ggctgcccac 132 0 

tcctcagggg aggaaagaga gatccagtat gcacccctca gctttcataa gggggagcct 13 SO 

caggacctat caggtcaaga agccaccaac aatgagtact cagagatcaa gatccccaag 1440 

taagaaaatg cagaggctcg ggcttgtttg agggttcacg acccctccag caaaggagtc 1500 

tgaggctgat tccagtagaa ttagcagccc tcaatgctgt gcaacaagac atcagaactt 1560 

attcctcttg tctaactgaa aatgcatgcc tgatgaccaa actctccctt cccccatcca 1620 

atcggtccac actccccgcc ctggcctctg gtacccacca ttctcctctg tacttctcta 1680 

aggatgacta ctttagattc cgaatatagt gagattgtaa cgtgaaaaaa aaaaaaaaaa 1740 

aaaaaaaa 174 8 


<210> 17 

<211> 4995 

<212> DNA 

<213> Homo sapiens 


<400> 17 

gccgcgccga ggaggctgcc gctctggctt gccgcccccc gccgccgctg cacaccggac 60 

ccagccgccg tgccgcgggc catggacctg cccaggggcc tggtggtggc ctgggcgctc 12 0 

agcctgtggc cagggttcac ggacaccttc aacatggaca ccaggaagcc ccgggtcatc 180 

cctggctcca ggaccgcctt ctttggctac acagtgcagc agcacgacat cagtggcaat 240 

aagtggctgg tcgtgggcgc cccactggaa accaatggct accagaagac gggagacgtg 300 

tacaagtgtc cagtgatcca cgggaactgc accaaactca acctgggaag ggtcaccctg 3 60 

tccaacgtgt ccgagcggaa agacaacatg cgcctcggcc ttagtctcgc caccaacccc 42 0 

aaggacaaca gcttcctggc ctgcagcccc ctctggtctc atgagtgtgg gagctcctac 480 

tacaccacag ggatgtgttc aagagtcaac tccaacttca ggttctccaa gaccgtggcc 540 

ccagctctcc aaaggtgcca gacctacatg gacatcgtca ttgtcctgga tggctccaac 60 0 

agcatctacc cctgggtgga ggttcagcac ttcctcatca acatcctgaa aaagttttac 660 

attggcccag ggcagatcca ggttggagtt gtgcagtatg gcgaagatgt ggtgcatgag 72 0 

tttcacctca atgactacag gtctgtaaaa gatgtggtgg aagcfcgccag ccacattgag 780 

cagagaggag gaacagagac ccggacggca tttggcattg aatttgcacg ctcagaggct 840 

ttccagaagg gtggaaggaa aggagccaag aaggtgatga ttgtcatcac agatggggag 900 
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tcccacgaca gcccagacct ggagaaggtg atccagcaaa gcgaaagaga caacgtaaca 960 

agatatgcgg tggccgtcct gggctactac aaccgcaggg ggatcaatcc agaaactttt 1020 

ctaaatgaaa tcaaatacat cgccagtgac cctgatgaca agcacttctt caatgtcact 1080 

gatgaggctg ccttgaagga cattgtcgat gccctggggg acagaatctt cagcctggaa 1140 

ggcaccaaca agaacgagac ctcctttggg ctggagatgt cacagacggg cttttcctcg 12 00 

cacgtggtgg aggatggggt tctgctggga gccgtcggtg cctatgactg gaatggagct 12 60 

gtgctaaagg agacgagtgc cgggaaggtc attcctctcc gcgagtccta cctgaaagag 1320 

ttccccgagg agctcaagaa ccatggtgca tacctggggt acacagtcac atcggtcgtg 13 80 

tcctccaggc aggggcgagt gtacgtggcc ggagcccccc ggttcaacca cacgggcaag 1440 

gtcatcctgt tcaccatgca caacaaccgg agcctcacca tccaccaggc tatgcggggc 15 00 

cagcagatag gctcttactt tgggagtgaa atcacctcgg tggacatcga cggcgacggc 15 60 

gtgactgatg tcctgctggt gggcgcaccc atgtacttca acgagggccg tgagcgaggc 162 0 

aaggtgtacg tctatgagct gagacagaac cggtttgttt ataacggaac gctaaaggat 1680 

tcacacagtt accagaatgc ccgatttggg tcctccattg cctcagttcg agacctcaac 17 40 

caggattcct acaatgacgt ggtggtggga gcccccctgg aggacaacca cgcaggagcc 18 00 

atctacatct tccacggctt ccgaggcagc atcctgaaga cacctaagca gagaatcaca 18 60 

gcctcagagc tggctaccgg cctccagtat tttggctgca gcatccacgg gcaattggac 192 0 

ctcaatgagg atgggctcat cgacctggca gtgggagccc ttggcaacgc tgtgattctg 19 8 0 

tggtcccgcc cagtggttca gatcaatgcc agcctccact ttgagccatc caagatcaac 2 040 

atcttccaca gagactgcaa gcgcagtggc agggatgcca cctgcctggc cgccttcctc 2100 

tgcttcacgc ccatcttcct ggcaccccat ttccaaacaa caactgttgg catcagatac 2160 

aacgccacca tggatgagag gcggtataca ccgagggccc acctggacga gggcggggac 222 0 

cgattcacca acagagccgt actgctctcc tccggccagg agctctgtga gcggatcaac 22 8 0 

ttccatgtcc tggacactgc tgactacgtg aagccagtga ccttctcagt cgagtattcc 2340 

ctggaggacc ctgaccatgg ccccatgctg gacgacggct ggcccaccac tctcagagtc 240 0 

tcggtgccct tctggaacgg ctgcaatgag gatgagcact gtgtccctga ccttgtgttg 24 6 0 

gatgcccgga gtgacctgcc cacggccatg gagtactgcc agagggtgct gaggaagcct 2 52 0 

gcgcaggact gctccgcata cacgctgtcc ttcgacacca cagtcttcat catagagagc 2580 

acacgccagc gagtggcggt ggaggccaca ctggagaaca ggggcgagaa cgcctacagc 2 640 

acggtcctaa atatctcgca gtcagcaaac ctgcagtttg ccagcttgat ccagaaggag 2700 

gactcagacg gtagcattga gtgtgtgaac gaggagagga ggctccagaa gcaagtctgc 27 60 

aacgtcagct atcccttctt ccgggccaag gccaaggtgg ctttccgtct tgattttgag 2820 

ttcagcaaat ccatcttcct acaccacctg gagatcgagc tcgctgcagg cagtgacagt 2 880 

aatgagcggg acagcaccaa ggaagacaac gtggccccct tacgcttcca cctcaaatac 2 94 0 

gaggctgacg tcctcttcac caggagcagc agcctgagcc actacgaggt caagctcaac 3 000 

agctcgctgg agagatacga tggtatcggg cctcccttca gctgcatctt caggatccag 3060 

aacttgggct tgttccccat ccacgggatt atgatgaaga tcaccattcc catcgccacc 3120 

aggagcggca accgcctact gaagctgagg gacttcctca cggacgaggt agcgaacacg 318 0 

tcctgtaaca tctggggcaa tagcactgag taccggccca ccccagtgga ggaagacttg 3240 

cgtcgtgctc cacagctgaa tcacagcaac tctgatgtcg tctccatcaa ctgcaatata 3300 

cggctggtcc ccaaccagga aatcaatttc catctactgg ggaacctgtg gttgaggtcc 3360 

ctaaaagcac tcaagtacaa atccatgaaa atcatggtca acgcagcctt gcagaggcag 3 42 0 

ttccacagcc ccttcatctt ccgtgaggag gatcccagcc gccagatcgt gtttgagatc 3480 

tccaagcaag aggactggca ggtccccatc tggatcattg taggcagcac cctggggggc 3 540 

ctcctactgc tggccctgct ggtcctggca ctgtggaagc tcggcttctt tagaagtgcc 3 600 

aggcgcagga gggagcctgg tctggacccc acccccaaag tgctggagtg aggctccaga 3 660 

ggagactttg agttgatggg ggccaggaca ccagtccagg tagtgttgag acccaggcct 3720 

gtggccccac cgagctggag cggagaggaa gccagctggc tttgcacttg acctcatctc 378 0 

ccgagcaatg gcgcctgctc cctccagaat ggaactcaag ctggttttaa gtggaactgc 3840 

cctactggga gactgggaca cctttaacac agacccctag ggatttaaag ggacacccct 3900 

acacacaccc aggcccacgc caaggcctcc ctcaggctct gtggagggca tttgctgccc 3960 

cagctactaa ggtgctagga attcgtaatc atccccatcc tccagagaaa cccagggagg 4020 

aagactgtaa atacgaaccc aatctgcaca ctccaggcct ctagttccag aaggatccaa 4080 

gacaaaacag atctgaattc tgcccttttc tctcacccat cccacccctc cattggctcc 4140 

caagtcacac ccactccctt ccccatagat aggcccctgg ggctcccgaa gaatgaaccc 4200 

aagagcaagg gcttgatggt gacagctgca agccagggat gaagaaagac tctgagatgt 42 60 

ggagactgat ggccaggcaa gtgggaccag gatactggac gctgtcctga gatgagaggt 4320 

agccgggctc tgcacccacg tgcattcaca ttgaccgcaa ctcacacatt cccccaccag 4380 
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ctgcagcccc ttgctctcag ctgccaaccc tcccgggtca cttttgttcc caggtacctc 4440 

atgggaagca tgtggatgac acaatccctg gggctgtgca ttcccacgtc ttcttgctgc 450 0 

agcctgcccc tagacatgga cgcaccggcc tggctgcagc tgggcagcag gggtaggggt 45 6 0 

agggagcctc ccctccctgt atcaccccct ccctacacac acacacacac acacacacac 462 0 

acactgcctc ccatccttcc ctcatgcccg ccagtgcaca gggaagggct tggccagcgc 4680 

tgttgagggg tcccctctgg aatgcactga ataaagcacg tgcaaggact cccggagcct 4740 

gtgcagcctt ggtggcaaat atctcatctg ccggccccca ggacaagtgg tatgaccagt 480 0 

gataatgccc caaggacaag gggcgtgcct ggcgcccagt ggagtaattt atgccttagt 4860 

cttgttttga ggtagaaatg caagggggac acatgaaagg catcagtccc cctgtgcata 492 0 

gtacgacctt tactgtcgta tttttgaaaa attaaaaata cagtgtttaa aaacaaaaaa 49 80 

aaaaaaaaaa aaaaa aqq^ 


<210> 18 

<211> 726 

<212> DNA 

<213> Homo sapiens 


<400> 18 

ggcacgagcc ggaccctgcc gccctgccac tatgtcccgc cgctctatgc tgcttgcctg 60 

ggctctcccc agcctccttc gactcggagc ggctcaggag acagaagacc cggcctgctg 12 0 

cagccccata gtgccccgga acgagtggaa ggccctggca tcagagtgcg cccagcacct 180 

gagcctgccc ttacgctatg tggtggtatc gcacacggcg ggcagcagct gcaacacccc 2 40 

cgcctcgtgc cagcagcagg cccggaatgt gcagcactac cacatgaaga cactgggctg 300 

gtgcgacgtg ggctacaact tcctgattgg agaagacggg ctcgtatacg agggccgtgg 3 60 

ctggaacttc acgggtgccc actcaggtca cttatggaac cccatgtcca ttggcatcag 420 

cttcatgggc aactacatgg atcgggtgcc cacaccccag gccatccggg cagcccaggg 480 

tctactggcc tgcggtgtgg ctcagggagc cctgaggtcc aactatgtgc tcaaaggaca 540 

ccgggatgtg cagcgtacac tctctccagg caaccagctc taccacctca tccagaattg 600 

gccacactac cgctccccct gaggccctgc tgatccgcac cccattcctc ccctcccatg 660 

gccaaaaacc ccactgtctc cttctccaat aaagatgtag ctcaaaaaaa aaaaaaaaaa 72 0 

cLcLcLgIcLcI rj ^ 


<210> 19 
<211> 3059 
<212> DNA 

<213> Homo sapiens 


<400> 19 

ggcacgagct gtcatccgtt tccatgccgt gaggtccatt cacagaacac atccatggct 6 0 

ctcatgctca gtttggttct gagtctcctc aagctgggat cagggcagtg gcaggtgttt 12 0 

gggccagaca agcctgtcca ggccttggtg ggggaggacg cagcattctc ctgtttcctg 180 

tctcctaaga ccaatgcaga ggccatggaa gtgcggttct tcaggggcca gttctctagc 2 40 

gtggtccacc tctacaggga cgggaaggac cagccattta tgcagatgcc acagtatcaa 3 00 

ggcaggacaa aactggtgaa ggattctatt gcggaggggc gcatctctct gaggctggaa 3 60 

aacattactg tgttggatgc tggcctctat gggtgcagga ttagttccca gtcttactac 42 0 

cagaaggcca tctgggagct acaggtgtca gcactgggct cagttcctct catttccatc 4 80 

gcgggatatg ttgatagaga catccagcta ctctgtcagt cctcgggctg gttcccccgg 540 

cccacagcga agtggaaagg tccacaagga caggatttgt ccacagactc caggacaaac 6 00 

agagacatgc atggcctgtt tgatgtggag atctctctga ccgtccaaga gaacgccggg 6 60 

agcatatcct gttccatgcg gcatgctcat ctgagccgag aggtggaatc cagggtacag 72 0 

ataggagact ggagaagaaa gcacggacag gcaggtaaaa gaaaatattc ctcttcacac 780 

atttatgact cctttccaag tctctcgttt afcggattttt atatcctgag gcccgtgggt 840 

ccctgcagag ccaagcttgt gatgggaact ctgaaattgc agattctggg ggaggtgcat 900 

tttgtagaga agccccatag ccttcttcag atctctggag ggtccacaac actcaaaaag 960 

ggtcccaatc cttggtcttt cccttctccc tgcgccctgt ttcccacgtg agcacggaac 102 0 

tgcctgctct ctctgcttgc tttcagaatt gagagacgcc cggaaacacg caggtaccaa 108 0 
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cgcctgagag ggtaacagtg ggcatggagt aggaagatga ccagtgacag atatggagcc 114 0 

catccagctt gtagacagca aatctgtgat gcccgaatcc accccagggt gcagctgcct 1200 

ctaaatacac ttcttggccc aggacttgga gggaaaagcg tagggactgg gtcagctagg 1260 

aggggtcaca ggcaagacgc cagggaactg agggcattag tagctggctt ctaggggtct 132 0 

gtgcaaaggg gaacgaagtg aagttagcag gaactggtgg gtggaaggaa gctgaatcct 1380 

ggagtcactc aaggtctcac aaagtcaaat agagggctta cgtgggaggg cagtggtagg 1440 

gctgggtgaa catctcatgg ttgagcatct ccaagcatca gtgaggcacg ggggctgccc 1500 

tggagaaggt acatggctgg tgggatagtg ggactggccg gatcctaccc ggagccagtc 15 6 0 

tgcagtggga gggtcgacct cttgctccag cccagatttc gtcttcagta actcatgctt 1620 

cctctctccc ccaccgcacc ccagtggagg tgactctgga tccagagacg gctcacccga 1680 

agctctgcgt ttctgatctg aaaactgtaa cccatagaaa agctcctcag gaggtgcctc 1740 

actctgagaa gagatttaca aggaagagtg tggtggcttc tcagggtttc caagcaggga 18 00 

aacattactg ggaggtggac gtgggacaaa atgtagggtg gtatgtggga gtgtgtcggg 1860 

atgacgtaga cagggggaag aacaatgtga ctttgtctcc caacaatggg tattgggtcc 1920 

tcagactgac aacagaacat ttgtatttca cattcaatcc ccattttatc agcctccccc 1980 

ccagcacccc tcctacacga gtaggggtct tcctggacta tgagggtggg accatctcct 2 040 

tcttcaatac aaatgaccag tcccttattt ataccctgct gacatgtcag tttgaaggct 2100 

tgttgagacc ctatatccag catgcgatgt atgacgagga aaaggggact cccatattca 2160 

tatgtccagt gtcctgggga tgagacagag aagaccctgc ttaaagggcc ccacaccaca 2220 

gacccagaca cagccaaggg agagtgctcc cgacaggtgg ccccagcttc ctctccggag 2280 

cctgcgcaca gagagtcacg ccccccactc tcctttaggg agctgaggtt cttctgccct 2340 

gagccctgca gcagcggcag tcacagcttc cagacgaggg gggattggcc tgaccctgtg 2400 

ggagtcagaa gccatggctg ccctgaagtg gggacggaat agactcacat taggtttagt 2460 

ttgtgaaaac tccatccagc taagcgatct tgaacaagtc acaacctccc aggctcctca 2520 

tttgctagtc acggacagtg attcctgcct cacaggtgaa gattaaagag acaacgaatg 25 8 0 

tgaatcatgc ttgcaggttt gagggccaca gtgtttgcta atggatgtgt ttttatgatt 2 640 

atacattttc cccaccataa aactctgttt gccttaattc ccacattaat ttaacttttc 2700 

ctcctatacc caaatccacc catggaatag ttaattggaa cacctgcctt tgtgaggctc 2760 

caaagaataa agaggaggta ggatttttca ctgattctat aagcccagca ttacctgata 2 82 0 

ccaaaaccag gcaaagaaaa cagaagaaga ggaaggaaaa ctacaggtcc atatccctca 2 880 

ttaacacaga cacaaaaatt ctaaataaaa ttttaacaaa ttaaactaaa caatatattt 2 940 

aaagatgata tataactact cagtgtggtt tgtcccacaa atgcagagtt ggtttaatat 3000 

ttaaatatca accagtgtaa ttcagcacat taataaagta aaaaaaaaaa aaaaaaaaa 3059 


<210> 20 

<211> 1699 

<212> DNA 

<213> Homo sapiens 


<400> 20 

ggcacgaggg ggcaggcatg ggagccgcgc gctctctccc ggcgcccaca cctgtctgag 60 

cggcgcagcg agccgcggcc cgggcgggct gctcggcgcg gaacagtgct cggcatggca 12 0 

gggattccag ggctcctctt ccttctcttc tttctgctct gtgctgttgg gcaagtgagc 180 

ccttacagtg ccccctggaa acccacttgg cctgcatacc gcctccctgt cgtcttgccc 240 

cagtctaccc tcaatttagc caagccagac tttggagccg aagccaaatt agaagtatct 3 00 

tcttcatgtg gaccccagtg tcataaggga actccactgc ccacttacga agaggccaag 360 

caatatctgt cttatgaaac gctctatgcc aatggcagcc gcacagagac gcaggtgggc 42 0 

atctacatcc tcagcagtag tggagatggg gcccaacacc gagactcagg gtcttcagga 480 

aagtctcgaa ggaagcggca gatttatggc tatgacagca ggttcagcat ttttgggaag 54 0 

gacttcctgc tcaactaccc tttctcaaca tcagtgaagt tatccacggg ctgcaccggc 600 

accctggtgg cagagaagca tgtcctcaca gctgcccact gcatacacga tggaaaaacc 660 

tatgtgaaag gaacccagaa gcttcgagtg ggcttcctaa agcccaagtt taaagatggt 72 0 

ggtcgagggg ccaacgactc cacttcagcc atgcccgagc agatgaaatt tcagtggatc 78 0 

cgggtgaaac gcacccatgt gcccaagggt tggatcaagg gcaatgccaa tgacatcggc 840 

atggattatg attatgccct cctggaactc aaaaagcccc acaagagaaa atttatgaag 900 

attggggtga gccctcctgc taagcagctg ccagggggca gaattcactt ctctggttat 960 

gacaatgacc gaccaggcaa tttggtgtat cgcttctgtg acgtcaaaga cgagacctat 1020 
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gacttgctct accaacaatg cgattcccag ccaggggcca gcgggtctgg ggtctatgtg 108 0 

aggatgtgga agagacaaca ccagaagtgg gagcggaaaa ttattggcat gatttcaggg 1140 

caccagtggg tggacatgga tggttcccca caggaattca cacgtggctg ttcagagatc 12 0 0 

actcctctcc aatatatccc agatatatct attggagtat aaggaaacta cctggattgt 12 6 0 

agggaggggt gacacagtgt ccctcctgca gcaactaagg tcgtcatgtt cttattttag 1320 

gagaggccaa attgtttttt gtcattgccg tgcacacgtg tgtgtgtgtg tgtgtgtgtg 13 80 

tgtaaggtgt cttataatct tttacctatt tcttacaatt gcaagatgac tggctttact 1440 

atttgaaaac tggtttgtgt atcatatcat atatcattta agcagtttga aggcatactt 1500 

ttgcatagaa ataaaaaaaa tactgatttg gggcaatgag gaatatttga caattaagtt 1560 

aatcttcacg tttttgcaaa ctttgatttt tatttcatct gaacttgttt caaagattta 1620 

tattaaatat ttggcataca aaaaaaaaaa aaaaaaacgg gggggcccgt acccaattcg 1680 

ccctatagtg aggcgatac 1699 


<210> 21 

<211> 1520 

<212> DNA 

<213> Homo sapiens 


<400> 21 

gaattcggca gagcaggtgc ccgacatggc gagtgtagtg ctgccgagcg gatcccagtg 60 

tgcggcggca gcggcggcgg cggcgcctcc cgggctccgg ctccggcttc tgctgttgct 120 

cttctccgcc gcggcactga tccccacagg tgatgggcag aatctgttta cgaaagacgt 180 

gacagtgatc gagggagagg ttgcgaccat cagttgccaa gtcaataaga gtgacgactc 240 

tgtgattcag ctactgaatc ccaacaggca gaccatttat ttcagggact tcaggccttt 300 

gaaggacagc aggtttcagt tgctgaattt ttctagcagt gaactcaaag tatcattgac 360 

aaacgtctca atttctgatg aaggaagata cttttgccag ctctataccg atcccccaca 420 

ggaaagttac accaccatca cagtcctggt cccaccacgt aatctgatga tcgatatcca 480 

gaaagacact gcggtggaag gtgaggagat tgaagtcaac tgcactgcta tggccagcaa 540 

gccagccacg actatcaggt ggttcaaagg gaacacagag ctaaaaggca aatcggaggt 60 0 

ggaagagtgg tcagacatgt acactgtgac cagtcagctg atgctgaagg tgcacaagga 660 

ggacgatggg gtcccagtga tctgccaggt ggagcaccct gcggtcactg gaaacctgca 720 

gacccagcgg tatctagaag tacagtataa gcctcaagtg cacattcaga tgacttatcc 780 

tctacaaggc ttaacccggg aaggggacgc gcttgagtta acatgtgaag ccatcgggaa 840 

gccccagcct gtgatggtaa cttgggtgag agtcgatgat gaaatgcctc aacacgccgt 900 

actgtctggg cccaacctgt tcatcaataa cctaaacaaa acagataatg gtacataccg 960 

ctgtgaagct tcaaacatag tggggaaagc tcactcggat tatatgctgt atgtatacga 102 0 

tccccccaca actatccctc ctcccacaac aaccaccacc accaccacca ccaccaccac 1080 

caccatcctt accatcatca cagattcccg agccaggtga agaaggctcg atcagggcag 1140 

tggatcatgc cgtgatcggt ggcgtcgtgg cggtggtggt gttcgccatg ctgtgcttgc 1200 

tcatcattct ggggcgctat tttgccagac ataaaggtac atacttcact catgaagcca 1260 

aaggagccga tgacgcagca gacgcagaca cagctataat caatgcagaa ggaggacaga 132 0 

acaactccga agaaaagaaa gagtacttca tctagatcag cctttttgtt tcaatgaggt 1380 

gtccaactgg ccctatttag atgataaaga gacagtgata ttggaacttg cgagaaattc 144 0 

gtgtgttttt ttatgaatgg gtggaaaggt gtgagactgg gaaggcttgg gatttgctgt 150 0 

gtaaaaaaaa aaaaaaaaaa 152 0 


<210> 22 

<211> 807 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> SITE 
<222> (803) 

<223> n equals a,t,g, or c 
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<400> 22 

tttttttttt tgagacggag tctcgctctt tcacccaggc cagactgaag tggcgcagtc €0 

tcggatcact gaaaagctca gcctcacggg atcacgacca ttctcctgcc tcagcctccc 12 0 

gagtggctgg gactaaaggc gcccgccacc gacgcccgga ctaatttttt gtatttatag 180 

tagagacggg gtttcaccgt gfctagccaag atggtctcga tctcttgaac tcgtgatccg 240 

cccgcgtcag cctcccaaag tgctgggatt acaggcgtga gccacagcgc ccggcctgta 30 0 

aagatttctt gagcaaatga atgagtaaat gaaaggagtg ctcaaatttc ttttattcta 3 60 

aaaaatgttc ccctttttta gaaaatgctc tgtagctttt gtaggtcttt cctgcactca 420 

aacatccact ccttaccctt tccaatctcc cttttcttct caacccatag aatgtacttc 480 

catgtctacc attccaccag aactactcta accaaggcta cccaggatat ccttatttct 540 

ttcaacttct ttgacatgct cagtcgcatt tggcactgtt aatgtcttcc tcctctttga 600 

aacacctcct ttgcatggca ctatcctggt tttcttcctt catttcagga gaaacttcat 660 

tctccttact gaattctttc ttcctcccct atccatcatc tagatgttgt tgtttctcag 72 0 

tgcagtgttc aatcctagac cccttttcat gtaactcaat gcgttttcct tgggagaatt 780 

aattcccttc cctggtgtca ctntgcc 8 07 


<210> 23 

<211> 2522 

<212> DNA 

<213> Homo sapiens 


<400> 23 

gccgctcgac accagcaccc cgcccagagc agtgccgctg cccaaatcct cgcaggcagc 6 0 

tcatcaacgc aattgcaact ccggctggag ccccggacct gcaagcctgg gtgtccgtgg 12 0 

gtccgtctgc ccagccatct gctggtggca cctctccctc ctgccgcctc cctcggtgaa 180 

ccccaccttg cagaagtgca gctcgcccgg agcagcccag gagctcagca tgcgtccccc 240 

aggcttcagg aacttcttgc tgctggcgtc ctcccttctc tttgctgggt tgtcagctgt 300 

tcctcaaagc ttctcgccat ctctgaggag ctggccgggc gccgcctgca ggctgtcccg 3 60 

ggccgagtcg gagcgacgct gccgcgcacc tgggcagccc ccgggggccg cgctgtgcca 42 0 

cggccggggc cgctgcgact gcggcgtctg catctgccac gtgactgagc cgggcatgtt 48 0 

cttcgggccc ctgtgtgagt gccatgagtg ggtgtgcgag acctacgacg ggagcacctg 540 

tgcaggccat ggtaagtgtg actgtggcaa gtgcaagtgt gaccagggat ggtatgggga 600 

tgcttgccag tacccaacta actgtgactt gacaaagaag aaaagtaacc aaatgtgcaa 6 60 

gaattcacaa gacatcatct gctctaatgc aggtacatgt cactgtggca ggtgtaagtg 720 

tgataattca gatggaagtg gacttgtgta tggtaaattt tgtgagtgtg acgatagaga 780 

atgcatagac gatgaaacag aagaaatatg tggaggccat gggaagtgtt actgtggaaa 840 

ctgctactgc aaggctggtt ggcatggaga taaatgtgaa ttccagtgcg atatcacccc 900 

ctgggaaagc aagcgaagat gcacgtctcc agatggcaaa atctgcagta gcagagggac 960 

ttgtgtatgt ggtgaatgta cctgtcacga tgttgatccg actggggact ggggagatat 102 0 

tcatggggac acctgtgaat gtgatgagag ggactgtaga gctgtctatg accgatattc 1080 

tgatgacttc tgttcaggtc atggacagtg taattgcgga agatgtgact gcaaagcagg 1140 

ctggtatggg aagaagtgtg agcacccaca gtcctgcacg ctgtcagctg aggagagcat 12 00 

caggaagtgc cagggaagct cggatctgcc ttgctctggg aggggtaaat gtgaatgtgg 12 60 

caaatgcacc tgctatcctc caggagatcg ccgggtgtat ggcaagactt gtgagtgtga 132 0 

tgatcgccgc tgtgaagacc tcgatggtgt ggtctgtgga ggccacggca catgttcctg 13 8 0 

tggtcgctgt gtttgtgaga gaggatggtt tggaaagctc tgccaacatc cgcggaagtg 1440 

taacatgacg gaagaacaaa gcaagaatct gtgtgaatca gcagatggca tattgtgctc 1500 

ggggaagggt tcttgtcatt gtgggaagtg catttgttct gctgaagagt ggtatatttc 1560 

tggggagttc tgtgactgtg atgacagaga ctgcgacaaa catgatggtc tcatttgtac 162 0 

cagggaatgg aatatgtagc tgtggaaact gtgaatgctg ggatggatgg aatggaaatg 168 0 

catgtgaaat ctggcttggc tcagaatatc cttaacaatt acatgagaga ggtctggatt 1740 

cttatttttt ctgggccatt agaacatata aatgcgaagg aaaccatgta tattcaccac 1800 

taggacaggt taaaaagacc attgtatgtt tttctatttc tgaattacga atgaaatccg 1860 

agtacctatt agaaatgagt tatgcaaatt tagatgcaaa taacattaga aaaaaaagat 192 0 

tcttccataa ttaacataag tggttcctaa cgagagcaat ttttccaccc aaaagtcatt 1980 

tggcaacatc tacagacaat tttgattgtc acactgggtc gggtaggaag gtatgctgca 2 04 0 

gacatttggt gggtagaggc cagggatgct gctgagcatc ccgcagtgta caggacagcc 210 0 
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cccaaacaag gaattatcca gccccaaatg ccaatagggc tcagactgag aaacattgag 2160 

ttatatggct attagaaatc cacattctta cacaagaaag accatattag aatctaagga 222 0 

aaacatgcat attcacatta attaatcgat cagatttttc cagaattccg tatcagtcac 2280 

cattttaata tggggacaat gaagacaagc acacaggagg tagaatatca gagtggggct 2340 

ggatcaaggg caaaaactgg tcattaagtc atctgacatt aaatcattta gccactaagt 2 40 0 

tatttgtcta ctctcacttt aaactcacca aagaagattc tcttaaagaa attatgaaaa 2460 

atgtacaatt taacatttta aataaatagt gacagaagtt gtttataaaa aaaaaaaaaa 2520 

aa 2522 


<210> 24 

<211> 1344 

<212> DNA 

<213> Homo sapiens 


<400> 24 

gtccgagaga acaggcctgt ctcaggcagg ccctgcgcct cctatgcgga gatgctactg 60 

ccactgctgc tgtcctcgct gctgggcggg tcccaggcta tggatgggag attctggata 120 

cgagtgcagg agtcagtgat ggtgccggag ggcctgtgca tctctgtgcc ctgctctttc 18 0 

tcctaccccc gacaagactg gacagggtct accccagctt atggctactg gttcaaagca 240 

gtgactgaga caaccaaggg tgctcctgtg gccacaaacc accagagtcg agaggtggaa 300 

atgagcaccc ggggccgatt tccaggctca ctgggggatc ccgccaaggg gaactgctcc 3 60 

ttggtgatca gaagacgcgc agatgcaagg atgagtcaca gtacttcttt cgggtggaga 420 

gaggaagcta tgtgagatat aatttcatga acgatgggtt ctttctaaaa gtaacagtgc 480 

tcagcttcac gcccagaccc caggaccaca acaccgacct cacctgccat gtggacttct 540 

ccagaaaggg tgtgagcgca cagaggaccg tccgactccg tgtggcctat gcccccagag 600 

accttgttat cagcatttca cgtgacaaca cgccagccct ggagccccag ccccagggga 6 60 

aatgtcccat acctggaagc ccaaaaaggc cagttcctgc ggctcctctg tgctgctgac 72 0 

agccagcccc ctgccacact gagctgggtc ctgcagaaca gagtcctctc ctcgtcccat 780 

ccctggggcc ctagacccct ggggctggag ctgcccgggg tgaaggctgg ggattcaggg 840 

cgctacacct gccgagcgga gaacaggctt ggctcccagc agcgagccct ggacctctct 900 

gtgcagtatc ctccagagaa cctgagagtg atggtttccc aagcaaacag gacagtcctg 9 60 

gaaaaccttg ggaacggcac gtctctccca gtactggagg gccaaagcct gtgcctggtc 102 0 

tgtgtcacac acagcagccc cccagccagg ctgagctgga cccagagggg acaggttctg 10 80 

agcccctccc agccctcaga ccccggggtc ctggagctgc ctcgggttca agtggagcac 1140 

gaaggagagt tcacctgcca cgctcggcac ccactgggct cccagcacgt ctctctcagc 12 0 0 

ctctccgtgc actactcccc gaagctgctg ggcccctcct gctcctggga agctgaaggt 12 60 

ctgcactgca actgctcctc ccaggccagc ccggccccct ctctgcgctg gtggcttggg 132 0 

gaggagctgc tggaagggga acaa 13 44 


<210> 25 

<211> 4631 

<212> DNA 

<213> Homo sapiens 


<400> 25 

ccgccgcgcc gaggaggctg ccgctctggc ttgccgcccc ccgccgccgc tgcacaccgg 60 

acccagccgc cgtgccgcgg gccatggacc tgcccagggg cctggtggtg gcctgggcgc 12 0 

tcagcctgtg gccagggttc acggacacct tcaacatgga caccaggaag ccccgggtca 180 

tccctggctc caggaccgcc ttctttggct acacagtgca gcagcacgac atcagtggca 2 40 

ataagtggct ggtcgtgggc gccccactgg aaaccaatgg ctaccagaag acgggagacg 3 00 

tgtacaagtg tccagtgatc cacgggaact gcaccaaact caacctggga agggtcaccc 3 60 

tgtccaacgt gtccgagcgg aaagacaaca tgcgcctcgg ccttagtctc gccaccaacc 42 0 

ccaaggacaa cagcttcctg gcctgcagcc ccctctggtc tcatgagtgt gggagctcct 48 0 

actacaccac agggatgtgt tcaagagtca actccaactt caggttctcc aagaccgtgg 54 0 

ccccagctct ccaaaggtgc cagacctaca tggacatcgt cattgtcctg gatggctcca 60 0 

acagcatcta cccctgggtg gaggttcagc acttcctcat caacatcctg aaaaagtttt 66 0 
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acattggccc agggcagatc caggttggag ttgtgcagta tggcgaagat gtggtgcatg 720 

agtttcacct caatgactac aggtctgtaa aagatgtggt ggaagctgcc agccacattg 780 

agcagagagg aggaacagag acccggacgg catttggcat tgaatttgca cgctcagagg 840 

ctttccagaa gggtggaagg aaaggagcca agaaggtgat gattgtcatc acagatgggg 900 

agtcccacga cagcccagac ctggagaagg tgatccagca aagcgaaaga gacaacgtaa 960 

caagatatgc ggtggccgtc ctgggctact acaaccgcag ggggatcaat ccagaaactt 102 0 

ttctaaatga aatcaaatac atcgccagtg accctgatga caagcacttc ttcaatgtca 1080 

ctgatgaggc tgccttgaag gacattgtcg atgccctggg ggacagaatc ttcagcctgg 114 0 

aaggcaccaa caagaacgag acctcctttg ggctggagat gtcacagacg ggcttttcct 12 00 

cgcacgtggt ggaggatggg gttctgctgg gagccgtcgg tgcctatgac tggaatggag 1260 

ctgtgctaaa ggagacgagt gccgggaagg tcattcctct ccgcgagtcc tacctgaaag 132 0 

agttccccga ggagctcaag aaccatggtg catacctggg gtacacagtc acatcggtcg 1380 

tgtcctccag gcaggggcga gtgtacgtgg ccggagcccc ccggttcaac cacacgggca 1440 

aggtcatcct gttcaccatg cacaacaacc ggagcctcac catccaccag gctatgcggg 1500 

gccagcagat aggctcttac tttgggagtg aaatcacctc ggtggacatc gacggcgacg 15 6 0 

gcgtgactga tgtcctgctg gtgggcgcac ccatgtactt caacgagggc cgtgagcgag 1620 

gcaaggtgta cgtctatgag ctgagacaga accggtttgt ttataacgga acgctaaagg 1680 

attcacacag ttaccagaat gcccgatttg ggtcctccat tgcctcagtt cgagacctca 17 40 

accaggattc ctacaatgac gtggtggtgg gagcccccct ggaggacaac cacgcaggag 1800 

ccatctacat cttccacggc ttccgaggca gcatcctgaa gacacctaag cagagaatca 18 60 

cagcctcaga gctggctacc ggcctccagt attttggctg cagcatccac gggcaattgg 192 0 

acctcaatga ggatgggctc atcgacctgg cagtgggagc ccttggcaac gctgtgattc 19 80 

tgtggtcccg cccagtggtt cagatcaatg ccagcctcca ctttgagcca tccaagatca 2 040 

acatcttcca cagagactgc aagcgcagtg gcagggatgc cacctgcctg gccgccttcc 2100 

tctgcttcac gcccatcttc ctggcacccc atttccaaac aacaactgtt ggcatcagat 2160 

acaacgccac catggatgag aagcggtata caccgagggc ccacctggac gaaggcgggg 222 0 

accgattcac caacagagcc gtactgctct cctccggcca ggagctctgt gagcggatca 22 8 0 

acttccatgt cctggacact gctgactacg tgaagccagt gaccttctca gtcgagtatt 2340 

ccctggagga ccctgaccat ggccccatgc tggacgacgg ctggcccacc actctcagag 2400 

tctcggtgcc cttctggaac ggctgcaatg aggatgagca ctgtgtccct gaccttgtgt 24 60 

tggatgcccg gagtgacctg cccacggcca tggagtactg ccagagggtg ctgaggaagc 2520 

ctgcgcagga ctgctccgca tacacgctgt ccttcgacac cacagtcttc atcatagaga 2580 

gcacacgcca gcgagtggcg gtggaggcca cactggagaa caggggcgag aacgcctaca 2 640 

gcacggtcct aaatatctcg cagtcagcaa acctgcagtt tgccagcttg atccagaagg 2700 

aggactcaga cggtagcatt gagtgtgtga acgaggagag gaggctccag aagcaagtct 2 7 60 

gcaacgtcag ctatcccttc ttccgggcca aggccaaggt ggctttccgt cttgattttg 2 820 

agttcagcaa atccatcttc ctacaccacc tggagatcga gctcgctgca ggcagtgaca 2 880 

gtaatgagcg ggacagcacc aaggaagaca acgtggcccc cttacgcttc cacctcaaat 2 940 

acgaagctga cgtcctcttc accaggagca gcagcctgag ccactacgag gtcaagctca 3000 

acagctcgct ggagagatac gatggtatcg ggcctccctt cagctgcatc ttcaggatcc 3060 

agaacttggg cttgttcccc atccacggga ttatgatgaa gatcaccatt cccatcgcca 312 0 

ccaggagcgg caaccgccta ctgaagctga gggacttcct cacggacgag ggcgaacacg 3180 

tcctgtaaca tctggggcaa tagcactgag taccggccca ccccagtgga ggaagacttg 32 40 

cgtcgtgctc cacagctgaa tcacagcaac tctgatgtcg tctccatcaa ctgcaatata 3300 

cggctggtcc ccaaccagga aatcaatttc catctactgg ggaacctgtg gttgaggtcc 3 3 60 

gtaaaagcac tcaagtacaa atccatgaaa atcatggtca acgcagcgtt gcagaggcag 342 0 

ttccacagcc ccttcatctt ccgtgaggag gatcccagcc gccagatcgt gtttgagatc 3480 

tccaagcaag aggactggca ggtccccatc tggatcattg taggcagcac cctggggggc 3540 

ctcctactgc tggccctgct ggtcctggca ctgtggaagc tcggcttctt tagaagtgcc 3 600 

aggcgcagga gggagcctgg tctggacccc acccccaaag tgctggagtg aggctccaga 3 660 

ggagactttg agttgatggg ggccagacac cagtccaggt agtgttgaga cccaggcctg 372 0 

tggccccacc gagctggagc ggagaggaag ccagctggct ttgcacttga cctcatctcc 37 80 

cgagcaatgg cgcctgctcc ctccagaatg gaactcaagc tggttttaag tggaactgcc 3840 

ctactgggag actgggacac ctttaacaca gacccctagg gatttaaagg gacaccccta 3 9 00 

cacacaccca ggcccacgcc aaggcctccc tcaggctctg tggagggcat ttgctgcccc 3 9 60 

agctactaag gtgctaggaa ttcgtaatca tccccatcct ccagagaaac ccagggagga 4020 

agactgtaaa tacgaaccca atctgcacac tccaggcctc tagttccaga aggatccaag 4080 

acaaaacaga tctgaattct gcccttttct ctcacccatc ccacccctcc attggctccc 4140 
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aagtcacacc cactcccttc 
gagcaagggc ttgatggtga 
agactgatgg ccaggcaagt 
ccgggctctg cacccacgtg 
gcagcccctt gctctcagct 
gggaagcatg tggatgacac 
cctgccccta gacatggacg 
ggagcctccc ctccctgtat 
gtcgacgagc t 


cccatagata 
cagctgcaag 
gggaccagga 
cattcacatt 
gccaaccctc 
aatccctggg 
caccggcctg 
caccccctcc 


ggcccctggg 
ccagggatga 
tactggacgc 
gaccgcaact 
ccgggtcact 
gctgtgcatt 
gctgcagctg 
ctacacgtcg 


gctcccgaag atgaacccaa 
agaaagactc tgagatgtgg 
tgtcctgaga tgagaggtag 
cacacattcc cccaccagct 
tttgttccca ggtacctcat 
cccacgtctt cttgctgcag 
ggcagcaggg gtaggggtag 
acgcggccgc gaattcccgg 


4200 
4260 
4320 
4380 
4440 
4500 
4560 
4620 
4631 


<210> 26 

<211> 2008 

<212> DNA 

<213> Homo sapiens 

<400> 26 

cgggggcttt ctaacgggaa aaactctact aaagggttca aaagctggag ctccaccgcg 6 0 

gtggcggccg ctctagaact agtggatccc ccgggctgca ggaattcggc acgagctcgt 12 0 

gccgaattcg gcacgagtca cagaacacat ccatggctct matgctcagt ttggttctga 180 

gtctcctcaa gctgggwtca gggcagtggc aggtgfcttgg gccagacaag cctgtccagg 240 

ccttggtggg ggaggacgca gcattctcct gtttcctgtc tcctaagacc aatgcagagg 3 00 

ccatggaagt gcggttcttc aggggccagt tctctagcgt ggtccacctc tacagggacg 3 60 

ggaaggacca gccatttatg cagatgccac agtatcaagg caggacaaaa ctggtgaagg 42 0 

attctattgc ggaggggcgc atctctctga ggctggaaaa cattactgtg ttggstgctg 480 

gcctctatgg gtgcaggatt agttcccagt cttactacca gaaggccatc tgggagctac 540 

aggtgtcagc actgggctca gttcctctca tttccatcac gggatatgtt gatagagaca 6 00 

tccagctact ctgtcagtcc tcgggctggt tcccccggcc cacagcgaag tggaaaggtc 660 

cacaaggaca ggatttgtcc acagactcca ggacaaacag agacatgcat ggcctgtttg 720 

atgtggagat ctctctgacc gtccaagaga acgccgggag catatcctgt tccatgcggc 780 

atgctcatct gagccgagag gtggaatcca gggtacagat aggagatacc tttttcgagc 84 0 

ctatatcgtg gmacctggyt accaaagtac tgggaatact ctgctgtggc ctattttttg 900 

gcattgttgg actgaagatt ttcttctcca aattccagtg gaaaatccag gcggaactgg 960 

actggagaag aaagcacgga caggcagaat tgagagacgc ccggaaacac gcagtggagg 102 0 

tgactctgga tccagagacg gctcacccga agctctgcgt ttctgatctg aaaactgtaa 1080 

cccatagaaa agctccccag gaggtgcctc actctgagaa gagatttaca aggaagagtg 1140 

tggtggcttc tcagagtttc caagcaggga aacattactg ggaggtggac ggaggacaca 12 0 0 

ataaaaggtg gcgcgtggga gtgtgccggg atgatgtgga caggaggaag gagtacgtga 12 6 0 

ctttgtctcc cgatcatggg tactgggtcc tcagactgaa tggagaacat ttgtatttca 1320 

cattaaatcc ccgttttatc agcgtcttcc ccaggacccc acctacaaaa ataggggtct 1380 

tcctggacta tgagtgtggg accatctcct tcttcaacat aaatgaccag tcccttattt 1440 

ataccctgac atgtcggttt gaaggcttat tgaggcccta cattgagtat ccgtcctata 1500 

atgagcaaaa tggaactccc agagacaagc aacagtgagt cctcctcaca ggcaaccacg 15 6 0 

cccttcctcc ccaggggtga aatgtaggat gaatcacatc ccacattctt ctttagggat 1620 

attaaggtct ctctcccaga tccaaagtcc cgcagcagcc ggccaaggtg gcttccagat 1680 

gaagggggac tggcctgtcc acatgggagt caggtgtcat ggctgccctg agctgggagg 17 40 

gaagaaggct gacattacat ttagtttgct ctcactccat ctggctaagt gatcttgaaa 1800 

taccacctct caggtgaaga accgtcagga attcccatct cacaggctgt ggtgtagatt 1860 

aagtagacaa ggaatgtgaa taatgcttag atcttattga tgacagagtg tatcctaatg 1920 

gtttgttcat tatattacac tttcagtaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaamc 19 80 

tcgagggggg gcccggtacc caattcgg 2 008 


<210> 27 

<211> 1654 

<212> DNA 

<213> Homo sapiens 
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<400> 27 

ggcacgaggg ggcaggcatg ggagccgcgc gctctctccc ggcgcccaca cctgtctgag 60 

cggcgcagcg agccgcggcc cgggcgggct gctcggcgcg gaacagtgct cggcatggca 120 

gggattccag ggctcctctt ccttctcttc tttctgctct gtgctgttgg gcaagtgagc 180 

ccttacagtg ccccctggaa acccacttgg cctgcatacc gcctccctgt cgtcttgccc 240 

cagtctaccc tcaatttagc caagccagac tttggagccg aagccaaatt agaagtatct 300 

tcttcatgtg gaccccagtg tcataaggga actccactgc ccacttacga agaggccaag 360 

caatatctgt cttatgaaac gctctatgcc aatggcagcc gcacagagac gcaggtgggc 42 0 

atctacatcc tcagcagtag tggagatggg gcccaacacc gagactcagg gtcttcagga 480 

aagtctcgaa ggaagcggca gatttatggc tatgacagca ggttcagcat ttttgggaag 54 0 

gacttcctgc tcaactaccc tttctcaaca tcagtgaagt tatccacggg ctgcaccggc 600 

accctggtgg cagagaagca tgtcctcaca gctgcccact gcatacacga tggaaaaacc 660 

tatgtgaaag gaacccagaa gcttcgagtg ggcttcctaa agcccaagtt taaagatggt 72 0 

ggtcgagggg ccaacgactc cacttcagcc atgcccgagc agatgaaatt tcagtggatc 7 80 

cgggtgaaac gcacccatgt gcccaagggt tggatcaagg gcaatgccaa tgacatcggc 840 

atggattatg attatgccct cctggaactc aaaaagcccc acaagagaaa atttatgaag 900 

attggggtga gccctcctgc taagcagctg ccagggggca gaattcactt ctctggttat 960 

gacaatgacc gaccaggcaa tttggtgtat cgcttctgtg acgtcaaaga cgagacctat 102 0 

gacttgctct accagcaatg cgatgcccag ccaggggcca gcgggtctgg ggtctatgtg 1080 

aggatgtgga agagacagca gcagaagtgg gagcgaaaaa ttattggcat tttttcaggg 1140 

caccagtggg tggacatgaa tggttcccca caggatttca acgtggctgt cagaatcact 12 00 

cctctcaaat atgcccagat ttgctattgg attaaaggaa actacctgga ttgtagggag 12 60 

gggtgacaca gtgttccctc ctggcagcaa ttaagggtct tcatgttctt attttaggag 1320 

aggccaaatt gttttttgtc attggcgtgc acacgtgtgt gtgtgtgtgt gtgtgtgtgt 1380 

aaggtgtctt ataatctttt acctatttct tacaattgca agatgactgg ctttactatt 1440 

tgaaaactgg tttgtgtatc atatcatata tcatttaagc agtttgaagg catacttttg 1500 

catagaaata aaaaaaatac tgatttgggg caatgaggaa tatttgacaa ttaagttaat 15 6 0 

cttcacgttt ttgcaaactt tgatttttat ttcatctgaa cttgtttcaa agatttatat 1620 

taaatatttg gcatacaaga aaaaaaaaaa aaaa 1654 


<210> 28 

<211> 1508 

<212> DNA 

<213> Homo sapiens 


<400> 28 

ggcacgagga catggcgtag tgtagtgctg ccgagcggat cccagtgtgc ggcggcagcg 6 0 

gcggcggcgg cgcctcccgg gctccggctc cggcttctgc tgttgctctt ctccgccgcg 120 

gcactgatcc ccacaggtga tgggcagaat ctgtttacga aagacgtgac agtgatcgag 180 

ggagaggttg cgaccatcag ttgccaagtc aataagagtg acgactctgt gattcagcta 240 

ctgaatccca acaggcagac catttatttc agggacttca ggcctttgaa ggacagcagg 300 

tttcagttgc tgaatttttc tagcagtgaa ctcaaagtat cattgacaaa cgtctcaatt 360 

tctgatgaag gaagatactt ttgccagctc tataccgatc ccccacagga aagttacacc 420 

accatcacag tcctggtccc accacgtaat ctgatgatcg atatccagaa agacactgcg 4 80 

gtggaaggtg aggagattga agtcaactgc actgctatgg ccagcaagcc agccacgact 540 

atcaggtggt tcaaagggaa cacagagcta aaaggcaaat cggaggtgga agagtggtca 600 

gacatgtaca ctgtgaccag tcagctgatg ctgaaggtgc acaaggagga cgatggggtc 6 60 

ccagtgatct gccaggtgga gcaccctgcg gtcactggaa acctgcagac ccagcggtat 720 

ctagaagtac agtataagcc tcaagtgcac attcagatga cttatcctct acaaggctta 780 

acccgggaag gggacgcgct tgagttaaca tgtgaagcca tcgggaagcc ccagcctgtg 840 

atggtaactt gggtgagagt cgatgatgaa atgcctcaac acgccgtact gtctgggccc 900 

aacctgttca tcaataacct aaacaaaaca gataatggta cataccgctg tgaagcttca 960 

aacatagtgg ggaaagctca ctcggattat atgctgtatg tatacgatcc ccccacaact 1020 

atccctcctc ccacaacaac caccaccacc accaccacca ccaccaccac catccttacc 1080 

atcatcacag attccccgag ccaggtgaag aaggctcgat cagggcagtg gatcatgccg 1140 

tgatcggtgg cgtcgtggcg gtggtggtgt tcgccatgct gtgcttgctc atcattctgg 12 00 

ggcgctattt tgccagacat aaaggtacat acttcactca tgaagccaaa ggagccgatg 12 60 
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acgcagcaga cgcagacaca 
aaaagaaaga gtacttcatc 
ctatttagat gataaagaga 
atgaatgggt ggaaaggtgt 


gctataatca atgcagaagg aggacagaac aactccgaag 
tagatcagcc tttttgtttc aatgaggtgt ccaactggcc 
cagtgatatt ggaacttgcg agaaattcgt gtgttttttt 
gagactggga aggcttggga tttgctgtgt aaaaaaaaaa 


1320 
1380 
1440 
1500 
1508 


aaaaaaaa 


<210> 29 

<211> 529 

<212> FRT 

<213> Homo sapiens 

<400> 29 

Met Val Ala Gin Asp Pro Gin Gly Cys Leu Gin Leu Cys Leu Ser Glu 
15 10 15 

Val Ala Asn Gly Leu Arg Asn Pro Val Ser Met Val His Ala Gly Asp 
20 25 30 

Gly Thr His Arg Phe Phe Val Ala Glu Gin Val Gly Val Val Trp Val 
35 40 45 

Tyr Leu Pro Asp Gly Ser Arg Leu Glu Gin Pro Phe Leu Asp Leu Lys 
50 55 60 

Asn lie Val Leu Thr Thr Pro Trp He Gly Asp Glu Arg Gly Phe Leu 
65 70 75 80 

Gly Leu Ala Phe His Pro Lys Phe Arg His Asn Arg Lys Phe Tyr He 
85 90 95 

Tyr Tyr Ser Cys Leu Asp Lys Lys Lys Val Glu Lys He Arg lie Ser 
100 105 110 

Glu Met Lys Val Ser Arg Ala Asp Pro Asn Lys Ala Asp Leu Lys Ser 
115 120 125 

Glu Arg Val He Leu Glu He Glu Glu Pro Ala Ser Asn His Asn Gly 
130 135 140 

Gly Gin Leu Leu Phe Gly Leu Asp Gly Tyr Met Tyr He Phe Thr Gly 
145 150 155 160 

Asp Gly Gly Gin Ala Gly Asp Pro Phe Gly Leu Phe Gly Asn Ala Gin 
165 170 175 

Asn Lys Ser Ser Leu Leu Gly Lys Val Leu Arg He Asp Val Asn Arg 
180 185 190 

Ala Gly Ser His Gly Lys Arg Tyr Arg Val Pro Ser Asp Asn Pro Phe 
195 200 205 

Val Ser Glu Pro Gly Ala His Pro Ala He Tyr Ala Tyr Gly He Arg 
210 215 220 

Asn Met Trp Arg Cys Ala Val Asp Arg Gly Asp Pro He Thr Arg Gin 
225 230 235 240 
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Gly Arg Gly Arg lie Phe Cys Gly Asp Val Gly Gin Asn Arg Phe Glu 
245 250 255 

Glu Val Asp Leu lie Leu Lys Gly Gly Asn Tyr Gly Trp Arg Ala Lys 
260 265 270 

Glu Gly Phe Ala Cys Tyr Asp Lys Lys Leu Cys His Asn Ala Ser Leu 
275 280 285 

Asp Asp Val Leu Pro lie Tyr Ala Tyr Gly His Ala Val Gly Lys Ser 
290 295 300 

Val Thr Gly Gly Tyr Val Tyr Arg Gly Cys Glu Ser Pro Asn Leu Asn 
305 310 315 320 

Gly Leu Tyr lie Phe Gly Asp Phe Met Ser Gly Arg Leu Met Ala Leu 
325 330 335 

Gin Glu Asp Arg Lys Asn Lys Lys Trp Lys Lys Gin Asp Leu Cys Leu 
340 345 350 

Gly Ser Thr Thr Ser Cys Ala Phe Pro Gly Leu lie Ser Thr His Ser 
355 360 365 

Lys Phe He He Ser Phe Ala Glu Asp Glu Ala Gly Glu Leu Tyr Phe 
370 375 380 

Leu Ala Thr Ser Tyr Pro Ser Ala Tyr Ala Pro Arg Gly Ser He Tyr 
3S5 390 395 400 

Lys Phe Val Asp Pro Ser Arg Arg Ala Pro Pro Gly Lys Cys Lys Tyr 
405 410 415 

Lys Pro Val Pro Val Arg Thr Lys Ser Lys Arg lie Pro Phe Arg Pro 
420 425 430 

Leu Ala Lys Thr Val Leu Asp Leu Leu Lys Glu Gin Ser Glu Lys Ala 
435 440 445 

Ala Arg Lys Ser Ser Ser Ala Thr Leu Ala Ser Gly Pro Ala Gin Gly 
450 455 460 

Leu Ser Glu Lys Gly Ser Ser Lys Lys Leu Ala Ser Pro Thr Ser Ser 
465 470 475 480 

Lys Asn Thr Leu Arg Gly Pro Gly Thr Lys Lys Lys Ala Arg Val Gly 
485 490 495 

Pro His Val Arg Gin Gly Lys Arg Arg Lys Ser Leu Lys Ser His Ser 
500 505 510 

Gly Arg Met Arg Pro Ser Ala Glu Gin Lys Arg Ala Gly Arg Ser Leu 
515 520 525 


Pro 
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<210> 30 

<211> 494 

<212> PRT 

<213> Homo sapiens 

<400> 30 

Met Arg Pro Pro Gly Phe Arg Asn Phe Leu Leu Leu Ala Ser Ser Leu 
15 10 15 

Leu Phe Ala Gly Leu Ser Ala Val Pro Gin Ser Phe Ser Pro Ser Leu 
20 25 30 

Arg Ser Trp Pro Gly Ala Ala Cys Arg Leu Ser Arg Ala Glu Ser Glu 
35 40 45 

Arg Arg Cys Arg Ala Pro Gly Gin Pro Pro Gly Ala Ala Leu Cys His 
50 55 60 

Gly Arg Gly Arg Cys Asp Cys Gly Val Cys lie Cys His Val Thr Glu 
65 70 75 80 

Pro Gly Met Phe Phe Gly Pro Leu Cys Glu Cys His Glu Trp Val Cys 
85 90 95 

Glu Thr Tyr Asp Gly Ser Thr Cys Ala Gly His Gly Lys Cys Asp Cys 
100 105 110 

Gly Lys Cys Lys Cys Asp Gin Gly Trp Tyr Gly Asp Ala Cys Gin Tyr 
115 120 125 

Pro Thr Asn Cys Asp Leu Thr Lys Lys Lys Ser Asn Gin Met Cys Lys 
130 135 140 

Asn Ser Gin Asp lie He Cys Ser Asn Ala Gly Thr Cys His Cys Gly 
145 150 155 160 

Arg Cys Lys Cys Asp Asn Ser Asp Gly Ser Gly Leu Val Tyr Gly Lys 
165 170 175 

Phe Cys Glu Cys Asp Asp Arg Glu Cys He Asp Asp Glu Thr Glu Glu 
180 185 190 

He Cys Gly Gly His Gly Lys Cys Tyr Cys Gly Asn Cys Tyr Cys Lys 
195 200 205 

Ala Gly Trp His Gly Asp Lys Cys Glu Phe Gin Cys Asp lie Thr Pro 
210 215 220 

Trp Glu Ser Lys Arg Arg Cys Thr Ser Pro Asp Gly Lys lie Cys Ser 
225 230 235 240 

Asn Arg Gly Thr Cys Val Cys Gly Glu Cys Thr Cys His Asp Val Asp 
245 250 255 

Pro Thr Gly Asp Trp Gly Asp He His Gly Asp Thr Cys Glu Cys Asp 
260 265 270 


Glu Arg Asp Cys Arg Ala Val Tyr Asp Arg Tyr Ser Asp Asp Phe Cys 
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275 280 285 

Ser Gly His Gly Gin Cys Asn Cys Gly Arg Cys Asp Cys Lys Ala Gly 
290 295 300 

Trp Tyr Gly Lys Lys Cys Glu His Pro Gin Ser Cys Thr Leu Ser Ala 
305 310 315 320 

Glu Glu Ser He Arg Lys Cys Gin Gly Ser Ser Asp Leu Pro Cys Ser 
325 330 335 

Gly Arg Gly Lys Cys Glu Cys Gly Lys Cys Thr Cys Tyr Pro Pro Gly 
340 345 350 

Asp Arg Arg Val Tyr- Gly Lys Thr Cys Glu Cys Asp Asp Arg Arg Cys 
355 360 365 

Glu Asp Leu Asp Gly Val Val Cys Gly Gly His Gly Thr Cys Ser Cys 
370 375 380 

Gly Arg Cys Val Cys Glu Arg Gly Trp Phe Gly Lys Leu Cys Gin His 
385 390 395 400 

Pro Arg Lys Cys Asn Met Thr Glu Glu Gin Ser Lys Asn Leu Cys Glu 
405 410 415 

Ser Ala Asp Gly He Leu Cys Ser Gly Lys Gly Ser Cys His Cys Gly 
420 425 430 

Lys Cys He Cys Ser Ala Glu Glu Trp Tyr He Ser Gly Glu Phe Cys 
435 440 445 

Asp Cys Asp Asp Arg Asp Cys Asp Lys His Asp Gly Leu He Cys Thr 
450 455 460 

Gly Asn Gly He Cys Ser Cys Gly Asn Cys Glu Cys Trp Asp Gly Trp 
465 470 475 480 

Asn Gly Asn Ala Cys Glu He Trp Leu Gly Ser Glu Tyr Pro 
485 490 


<210> 31 

<211> 148 

<212> PRT 

<213> Homo sapiens 

<400> 31 

Met Glu Thr Gly Ala Leu Arg Arg 
1 5 

Leu Leu Cys Gly Gly Cys Pro Arg 
20 

Met Leu Glu Arg Leu Pro Leu Cys 
35 40 

Gly Lys Val Asp Val Trp Lys Trp 


Pro Gin Leu Leu Pro Leu Leu Leu 
10 15 

Ala Gly Gly Cys Asn Glu Thr Gly 
25 30 

Gly Lys Ala Phe Ala Asp Met Met 

45 

Cys Asn Leu Ser Glu Phe He Val 
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60 
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Tyr Tyr Glu Ser 
65 

Gly Cys Tyr Trp 


His Arg Gin Phe 
100 

Asp Pro Pro Asp 
115 

Leu Thr Val Ala 
130 

Asp Thr Leu Leu 

145 


Phe Thr Asn Cys 
70 

Pro Asn Pro Leu 
85 

Phe Ser Asn Cys 


Glu Val Leu He 
120 

Met Ala Gly Leu 
135 


Thr Glu Met Glu 
75 

Ala Gin Gly Phe 
90 

Thr Val Asp Arg 
105 

Pro Leu He Val 


Val Val Trp Arg 
140 


Ala Asn Val Val 
80 

lie Thr Gly lie 
95 

Val His Leu Glu 
110 

lie Pro Val Val 
125 

Ser Lys Arg Thr 


<210> 32 

<211> 332 

<212> PRT 

<213> Homo sapiens 


<400> 32 

Met Arg Leu Leu 

1 

Thr Gly Thr Ala 

20 

Gin Met Pro Arg 
35 

Asp Val Leu Asn 
50 

Tyr Glu Glu Leu 
65 

Thr Ser Leu Ala 


Ala Pro Gly Thr 
100 

Ala Gly Leu Leu 

115 

Leu Val Cys Val 
130 

Leu Glu Asp He 
145 

Arg Phe Asn Arg 


Ala Phe Leu Ser 
5 

Ser Leu Pro Arg 


Glu Gly Asp Ser 
40 

Pro Asp Asn Tyr 
55 

Thr Asp Tyr Gly 
70 

Pro Ala Thr Ser 
85 

Pro Ser Ser Asn 


Leu Ser Ser Gin 
120 

Cys Leu Gly Ser 
135 

Pro Pro Leu Pro 
150 

He Ser Arg He 


Leu Leu Ala Leu 

10 

Lys Glu Arg Lys 
25 

Phe Glu Val Leu 


Gly Glu Val He 
60 

Asp Gin Leu Pro 
75 

He Ser Pro Ala 
90 

Pro Thr Met Thr 
105 

Pro Asn His Gly 


Ser Va 1 Tyr Cys 
140 

Arg Arg Thr Ala 
155 

Arg Ala Glu Asp 


Val Leu Gin Glu 
15 

Arg Arg Glu Glu 
30 

Pro Leu Arg Asn 
45 

Asp Leu Ser Asn 


Glu Val Lys Val 
80 

Lys Ser Thr Thr 
95 

Arg Pro Thr Thr 

110 

Leu Pro Thr Cys 
125 

Asp Asp He Asp 


Tyr Leu Tyr Ala 
160 

Phe Lys Gly Leu 
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165 170 175 

Thr Lys Leu Lys Arg He Asp Leu Ser Asn Asn Leu He Ser Ser He 
180 185 190 

Asp Asn Asp Ala Phe Arg Leu Leu His Ala Leu Gin Asp Leu lie Leu 
195 200 205 

Pro Glu Asn Gin Leu Glu Ala Leu Pro Val Leu Pro Ser Gly lie Glu 
210 215 220 

Phe Leu Asp Val Arg Leu Asn Arg Leu Gin Ser Ser Gly He Gin Pro 
225 230 235 240 

Ala Ala Phe Arg Ala Met Glu Lys Leu Gin Phe Leu Tyr Leu Ser Asp 
245 250 255 

Asn Leu Leu Asp Ser He Pro Gly Pro Leu Pro Pro Ser Leu Arg Ser 
260 265 270 

Val His Leu Gin Asn Asn Leu He Glu Thr Met Gin Arg Asp Val Phe 
275 280 285 

Cys Asp Pro Glu Glu His Lys His Thr Arg Arg Gin Leu Glu Asp He 
290 295 300 

Arg Leu Asp Gly Asn Pro He Asn Leu Ser Leu Phe Pro Ser Ala Tyr 
305 310 315 320 

Phe Cys Leu Pro Arg Leu Pro He Gly Arg Phe Thr 
325 330 


<210> 33 

<211> 639 

<212> PRT 

<213> Homo sapiens 

<400> 33 

Met Leu Leu Pro Leu Leu Leu Ser Ser Leu Leu Gly Gly Ser Gin Ala 
15 10 15 

Met Asp Gly Arg Phe Trp He Arg Val Gin Glu Ser Val Met Val Pro 
20 25 30 

Glu Ala Cys Asp He Ser Val Pro Cys Ser Phe Ser Tyr Pro Arg Gin 
35 40 45 

Asp Trp Thr Gly Ser Thr Pro Ala Tyr Gly Tyr Trp Phe Lys Ala Val 
50 55 60 

Thr Glu Thr Thr Lys Gly Ala Pro Val Ala Thr Asn His Gin Ser Arg 
65 70 75 80 

Glu Val Glu Met Ser Thr Arg Gly Arg Phe Gin Leu Thr Gly Asp Pro 
85 90 95 


Ala Lys Gly Asn 


Cys Ser Leu Val He Arg Asp Ala Gin Met Gin Asp 
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105 
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110 


Glu Ser Gin Tyr 
115 

Asn Phe Met Asn 
130 

Thr Pro Arg Pro 
145 

Phe Ser Arg Lys 


Ala Tyr Ala Pro 
180 

Pro Ala Leu Glu 

195 

Gin Lys Gly Gin 

210 


Phe Phe Arg Val 
120 

Asp Gly Phe Phe 
135 

Gin Asp His Asn 
150 

Gly Val Ser Ala 
165 

Arg Asp Leu Val 


Pro Gin Pro Gin 
200 

Phe Leu Arg Leu 
215 


Glu Arg Gly Ser 


Leu Lys Val Thr 
140 

Thr Asp Leu Thr 
155 

Gin Arg Thr Val 
170 

lie Ser lie Ser 
185 

Gly Asn Val Pro 


Leu Cys Ala Ala 
220 


Tyr Val Arg Tyr 
125 

Val Leu Ser Phe 


Cys His Val Asp 
160 

Arg Leu Arg Val 

175 

Arg Asp Asn Thr 
190 

Tyr Leu Glu Ala 
205 

Asp Ser Gin Pro 


Pro Ala Thr Leu 
225 

His Pro Trp Gly 


Ala Gly Asp Ser 
260 

Ser Gin Gin Arg 
275 


Ser Trp Val Leu 
230 

Pro Arg Pro Leu 
245 

Gly Arg Tyr Thr 


Ala Leu Asp Leu 

280 


Gin Asn Arg Val 
235 

Gly Leu Glu Leu 
250 

Cys Arg Ala Glu 
265 

Ser Val Gin Tyr 


Leu Ser Ser Ser 
240 

Pro Gly Val Lys 
255 

Asn Arg Leu Gly 
270 

Pro Pro Glu Asn 
285 


Leu Arg Val Met 
290 

Gly Asn Gly Thr 
305 

Val Cys Val Thr 


Arg Gly Gin Val 
340 

Glu Leu Pro Arg 
355 

Ala Arg His Pro 
370 

His Tyr Ser Pro 
385 


Val Ser Gin Ala 
295 

Ser Leu Pro Val 
310 

His Ser Ser Pro 
325 

Leu Ser Pro Ser 


Val Gin Val Glu 
360 

Leu Gly Ser Gin 
375 

Lys Leu Leu Gly 
390 


Asn Arg Thr Val 
300 

Leu Glu Gly Gin 
315 

Pro Ala Arg Leu 
330 

Gin Pro Ser Asp 
345 

His Glu Gly Glu 


His Val Ser Leu 
380 

Pro Ser Cys Ser 
395 


Leu Glu Asn Leu 


Ser Leu Cys Leu 
320 

Ser Trp Thr Gin 
335 

Pro Gly Val Leu 
350 

Phe Thr Cys His 
365 

Ser Leu Ser Val 


Trp Glu Ala Glu 
400 


Gly Leu His Cys Ser Cys Ser Ser Gin Ala Ser Pro Ala Pro Ser Leu 
405 410 415 
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Arg Trp Trp Leu Gly 

420 

Ser Phe Glu Val Thr 
435 

Leu Ser Leu His Gly 
450 

Ala Trp Asn Val His 
465 

Asp Lys Lys Gly Leu 
485 

Gly lie Gly He Thr 
500 

Met Lys He Leu Pro 
515 

Arg Phe Ser Arg His 
530 

Thr Ala Gly Pro Leu 
545 

Ser Pro Arg Thr Pro 
565 

Lys Asn Gin Lys Lys 
580 

Ser Ser Thr Gin Ala 
595, 

Tyr Ala Thr Leu Asn 
610 

Met Pro Lys Gly Thr 
625 


Glu Glu Leu Leu Glu 
425 

Pro Ser Ser Ala Gly 
440 

Gly Leu Ser Ser Gly 
455 

Gly Ala Gin Ser Gly 
47 0 

He Ser Thr Ala Phe 
490 

Ala Leu Leu Phe Leu 
505 

Lys Arg Arg Thr Gin 
520 

Ser Thr lie Leu Asp 
535 

Ala Gin Lys Arg Asn 
550 

Leu Pro Pro Gly Ala 
570 

Gin Tyr Gin Leu Pro 
585 

Pro Glu Ser Gin Glu 
600 

Phe Pro Gly Val Arg 
615 

Gin Ala Asp Tyr Ala 
630 


Gly Asn Ser Ser Gin Asp 
430 

Pro Trp Ala Asn Ser Ser 
445 

Leu Arg Leu Arg Cys Glu 
460 

Ser lie Leu Gin Leu Pro 
475 480 

Ser Asn Gly Ala Phe Leu 
495 

Cys Leu Ala Leu lie He 
510 

Thr Glu Thr Pro Arg Pro 
525 

Tyr He Asn Val Val Pro 
540 

Gin Lys Ala Thr Pro Asn 
555 560 

Pro Ser Pro Glu Ser Lys 
575 

Ser Phe Pro Glu Pro Lys 
590 

Ser Gin Glu Glu Leu His 
605 

Pro Arg Pro Glu Ala Arg 
620 

Glu Val Lys Phe Gin 
635 


<210> 34 

<211> 467 

<212> PRT 

<213> Homo sapiens 

<400> 34 

Met Leu Leu Leu Leu Leu Leu Pro Leu Leu Trp Gly Arg Glu Arg Val 
15 10 15 

Glu Gly Gin Lys Ser Asn Arg Lys Asp Tyr Ser Leu Thr Met Gin Ser 
20 25 30 


Ser Val Thr Val Gin Glu Gly Met Cys Val His Val Arg Cys Ser Phe 
35 40 45 
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Ser Tyr Pro Val Asp Ser Gin Thr Asp Ser Asp Pro Val His Gly Tyr 
50 55 60 

Trp Phe Arg Ala Gly Asn Asp He Ser- Trp Lys Ala Pro Val Ala Thr 
65 70 75 80 

Asn Asn Pro Ala Trp Ala Val Gin Glu Glu Thr Arg Asp Arg Phe His 
85 90 95 

Leu Leu Gly Asp Pro Gin Thr Lys Asn Cys Thr Leu Ser He Arg Asp 
100 105 HO 

Ala Arg Met Ser Asp Ala Gly Arg Tyr Phe Phe Arg Met Glu Lys Gly 
H5 120 125 

Asn He Lys Trp Asn Tyr Lys Tyr Asp Gin Leu Ser Val Asn Val Thr 
130 135 140 

Ala Leu Thr His Arg Pro Asn lie Leu lie Pro Gly Thr Leu Glu Ser 
145 150 155 160 

Gly Cys Phe Gin Asn Leu Thr Cys Ser Val Pro Trp Ala Cys Glu Gin 
165 170 175 

Gly Thr Pro Pro Met lie Ser Trp Met Gly Thr Ser Val Ser Pro Leu 
180 185 190 

His Pro Ser Thr Thr Arg Ser Ser Val Leu Thr Leu He Pro Gin Pro 
195 200 205 

Gin His His Gly Thr Ser Leu Thr Cys Gin Val Thr Leu Pro Gly Ala 
210 215 220 

Gly Val Thr Thr Asn Arg Thr He Gin Leu Asn Val Ser Tyr Pro Pro 
225 230 235 240 

Gin Asn Leu Thr Val Thr Val Phe Gin Gly Glu Gly Thr Ala Ser Thr 
245 250 255 

Ala Leu Gly Asn Ser Ser Ser Leu Ser Val Leu Glu Gly Gin Ser Leu 
260 265 270 

Arg Leu Val Cys Ala Val Asp Ser Asn Pro Pro Ala Arg Leu Ser Trp 
275 280 285 

Thr Trp Arg Ser Leu Thr Leu Tyr Pro Ser Gin Pro Ser Asn Pro Leu 
290 295 300 

Val Leu Glu Leu Gin Val His Leu Gly Asp Glu Gly Glu Phe Thr Cys 
305 310 315 320 

Arg Ala Gin Asn Ser Leu Gly Ser Gin His Val Ser Leu Asn Leu Ser 
325 330 335 

Leu Gin Gin Glu Tyr Thr Gly Lys Met Arg Pro Val Ser Gly Val Leu 
340 345 350 
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Leu Gly Ala Val 
355 

Phe Cys Val lie 
370 

Arg Pro Ala Ala 
385 

lie Arg Gly Ser 


Asp Asn Pro Arg 
420 

Arg Glu He Gin 
435 

Asp Leu Ser Gly 
450 

He Pro Lys 
465 


Gly Gly Ala Gly 
360 

Phe lie Val Val 
375 

Asp Val Gly Asp 
390 

Ala Ser Gin Gly 
405 

His His Gly Leu 


Tyr Ala Pro Leu 
440 

Gin Glu Ala Thr 
455 


26 

Ala Thr Ala Leu 


Arg Ser Cys Arg 
380 

He Gly Met Lys 
395 

Asn Leu Thr Glu 
410 

Ala Ala His Ser 
425 

Ser Phe His Lys 


Asn Asn Glu Tyr 
460 


Val Phe Leu Ser 
365 

Lys Lys Ser Ala 


Asp Ala Asn Thr 
400 

Ser Trp Ala Asp 
415 

Ser Gly Glu Glu 
430 

Gly Glu Pro Gin 
445 

Ser Glu lie Lys 


<210> 35 

<211> 1189 

<212> PRT 

<213> Homo sapiens 

<400> 35 

Met Asp Leu Pro Arg Gly Leu Val Val Ala Trp Ala Leu Ser Leu Trp 
15 10 15 

Pro Gly Phe Thr Asp Thr Phe Asn Met Asp Thr Arg Lys Pro Arg Val 
20 25 30 

lie Pro Gly Ser Arg Thr Ala Phe Phe Gly Tyr Thr Val Gin Gin His 
35 40 45 

Asp He Ser Gly Asn Lys Trp Leu Val Val Gly Ala Pro Leu Glu Thr 
50 55 60 

Asn Gly Tyr Gin Lys Thr Gly Asp Val Tyr Lys Cys Pro Val He His 
65 70 75 80 

Gly Asn Cys Thr Lys Leu Asn Leu Gly Arg Val Thr Leu Ser Asn Val 
85 90 95 

Ser Glu Arg Lys Asp Asn Met Arg Leu Gly Leu Ser Leu Ala Thr Asn 
100 105 110 

Pro Lys Asp Asn Ser Phe Leu Ala Cys Ser Pro Leu Trp Ser His Glu 
115 120 125 


Cys Gly Ser Ser Tyr Tyr Thr Thr Gly Met Cys Ser Arg Val Asn Ser 
130 135 140 
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Asn Phe Arg Phe Ser Lys Thr Val Ala Pro Ala Leu Gin Arg Cys Gin 

145 150 155 160 

Thr Tyr Met Asp lie Val lie Val Leu Asp Gly Ser Asn Ser lie Tyr 
165 170 175 

Pro Trp Val Glu Val Gin His Phe Leu lie Asn He Leu Lys Lys Phe 
180 185 190 

Tyr lie Gly Pro Gly Gin He Gin Val Gly Val Val Gin Tyr Gly Glu 
195 200 205 

Asp Val Val His Glu Phe His Leu Asn Asp Tyr Arg Ser Val Lys Asp 
210 215 220 

Val Val Glu Ala Ala Ser His He Glu Gin Arg Gly Gly Thr Glu Thr 
22 5 230 235 240 

Arg Thr Ala Phe Gly He Glu Phe Ala Arg Ser Glu Ala Phe Gin Lys 
245 250 255 

Gly Gly Arg Lys Gly Ala Lys Lys Val Met He Val He Thr Asp Gly 
260 265 270 

Glu Ser His Asp Ser Pro Asp Leu Glu Lys Val He Gin Gin Ser Glu 
275 280 285 

Arg Asp Asn Val Thr Arg Tyr Ala Val Ala Val Leu Gly Tyr Tyr Asn 
290 295 300 

Arg Arg Gly He Asn Pro Glu Thr Phe Leu Asn Glu He Lys Tyr He 
305 310 315 320 

Ala Ser Asp Pro Asp Asp Lys His Phe Phe Asn Val Thr Asp Glu Ala 
325 330 335 

Ala Leu Lys Asp He Val Asp Ala Leu Gly Asp Arg He Phe Ser Leu 
340 345 350 

Glu Gly Thr Asn Lys Asn Glu Thr Ser Phe Gly Leu Glu Met Ser Gin 
355 360 365 

Thr Gly Phe Ser Ser His Val Val Glu Asp Gly Val Leu Leu Gly Ala 
370 375 380 

Val Gly Ala Tyr Asp Trp Asn Gly Ala Val Leu Lys Glu Thr Ser Ala 
385 390 395 400 

Gly Lys Val He Pro Leu Arg Glu Ser Tyr Leu Lys Glu Phe Pro Glu 
405 410 415 

Glu Leu Lys Asn His Gly Ala Tyr Leu Gly Tyr Thr Val Thr Ser Val 
420 425 430 

Val Ser Ser Arg Gin Gly Arg Val Tyr Val Ala Gly Ala Pro Arg Phe 
435 440 445 


Asn His Thr Gly Lys Val He Leu Phe Thr Met His Asn Asn Arg Ser 
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450 455 460 

Leu Thr lie His Gin Ala Met Arg Gly Gin Gin He Gly Ser Tyr Phe 
465 470 475 480 

Gly Ser Glu He Thr Ser Val Asp lie Asp Gly Asp Gly Val Thr Asp 
485 490 495 

Val Leu Leu Val Gly Ala Pro Met Tyr Phe Asn Glu Gly Arg Glu Arg 
500 505 510 

Gly Lys Val Tyr Val Tyr Glu Leu Arg Gin Asn Arg Phe Val Tyr Asn 
515 520 525 

Gly Thr Leu Lys Asp Ser His Ser Tyr Gin Asn Ala Arg Phe Gly Ser 
530 535 540 

Ser lie Ala Ser Val Arg Asp Leu Asn Gin Asp Ser Tyr Asn Asp Val 
545 550 555 560 

Val Val Gly Ala Pro Leu Glu Asp Asn His Ala Gly Ala He Tyr lie 
565 570 575 

Phe His Gly Phe Arg Gly Ser He Leu Lys Thr Pro Lys Gin Arg He 
580 585 590 

Thr Ala Ser Glu Leu Ala Thr Gly Leu Gin Tyr Phe Gly Cys Ser He 
595 600 605 

His Gly Gin Leu Asp Leu Asn Glu Asp Gly Leu He Asp Leu Ala Val 
610 615 620 

Gly Ala Leu Gly Asn Ala Val He Leu Trp Ser Arg Pro Val Val Gin 
625 630 635 640 

He Asn Ala Ser Leu His Phe Glu Pro Ser Lys lie Asn He Phe His 
645 650 655 

Arg Asp Cys Lys Arg Ser Gly Arg Asp Ala Thr Cys Leu Ala Ala Phe 
660 665 670 

Leu Cys Phe Thr Pro He Phe Leu Ala Pro His Phe Gin Thr Thr Thr 
675 680 685 

Val Gly He Arg Tyr Asn Ala Thr Met Asp Glu Arg Arg Tyr Thr Pro 
690 695 700 

Arg Ala His Leu Asp Glu Gly Gly Asp Arg Phe Thr Asn Arg Ala Val 
705 710 715 720 

Leu Leu Ser Ser Gly Gin Glu Leu Cys Glu Arg He Asn Phe His Val 
725 730 735 

Leu Asp Thr Ala Asp Tyr Val Lys Pro Val Thr Phe Ser Val Glu Tyr 
740 745 750 


Ser Leu Glu Asp Pro Asp His Gly Pro Met Leu Asp Asp Gly Trp Pro 
755 760 765 
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Thr Thr Leu Arg Val Ser Val Pro Phe Trp Asn Gly Cys Asn Glu Asp 
770 775 780 

Glu His Cys Val Pro Asp Leu Val Leu Asp Ala Arg Ser Asp Leu Pro 
785 790 795 800 

Thr Ala Met Glu Tyr Cys Gin Arg Val Leu Arg Lys Pro Ala Gin Asp 
805 810 815 

Cys Ser Ala Tyr Thr Leu Ser Phe Asp Thr Thr Val Phe lie lie Glu 
820 825 830 

Ser Thr Arg Gin Arg Val Ala Val Glu Ala Thr Leu Glu Asn Arg Gly 
835 840 845 

Glu Asn Ala Tyr Ser Thr Val Leu Asn lie Ser Gin Ser Ala Asn Leu 
850 855 860 

Gin Phe Ala Ser Leu lie Gin Lys Glu Asp Ser Asp Gly Ser lie Glu 
865 870 875 880 

Cys Val Asn Glu Glu Arg Arg Leu Gin Lys Gin Val Cys Asn Val Ser 
885 890 895 

Tyr Pro Phe Phe Arg Ala Lys Ala Lys Val Ala Phe Arg Leu Asp Phe 
900 905 910 

Glu Phe Ser Lys Ser lie Phe Leu His His Leu Glu lie Glu Leu Ala 
915 920 925 

Ala Gly Ser Asp Ser Asn Glu Arg Asp Ser Thr Lys Glu Asp Asn Val 
930 935 940 

Ala Pro Leu Arg Phe His Leu Lys Tyr Glu Ala Asp Val Leu Phe Thr 
945 950 955 960 

Arg Ser Ser Ser Leu Ser His Tyr Glu Val Lys Leu Asn Ser Ser Leu 
965 970 975 

Glu Arg Tyr Asp Gly lie Gly Pro Pro Phe Ser Cys Xle Phe Arg lie 
980 985 990 

Gin Asn Leu Gly Leu Phe Pro lie His Gly He Met Met Lys He Thr 
995 1000 1005 

lie Pro He Ala Thr Arg Ser Gly Asn Arg Leu Leu Lys Leu Arg Asp 
1010 1015 1020 

Phe Leu Thr Asp Glu Val Ala Asn Thr Ser Cys Asn He Trp Gly Asn 
1030 1035 1040 

Ser Thr Glu Tyr Arg Pro Thr Pro Val Glu Glu Asp Leu Arg Arg Ala 
1045 1050 1055 

Pro Gin Leu Asn His Ser Asn Ser Asp Val Val Ser He Asn Cys Asn 
1060 1065 1070 
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lie Arg Leu Val Pro 
1075 

Leu Trp Leu Arg Ser 
1090 

Met Val Asn Ala Ala 
1110 

Arg Glu Glu Asp Pro 
1125 

Glu Asp Trp Gin Val 
1140 

Gly Leu Leu Leu Leu 
1155 

Phe Phe Arg Ser Ala 
1170 

Pro Lys Val Leu Glu 


30 

Asn Gin Glu lie Asn 
1080 

Leu Lys Ala Leu Lys 
1095 

Leu Gin Arg Gin Phe 
1115 

Ser Arg Gin lie Val 

1130 

Pro lie Trp lie lie 
1145 

Ala Leu Leu Val Leu 
1160 

Arg Arg Arg Arg Glu 
1175 


Phe His Leu Leu Gly Asn 
1085 

Tyr Lys Ser Met Lys lie 
1100 

His Ser Pro Phe He Phe 
1120 

Phe Glu He Ser Lys Gin 
1135 

Val Gly Ser Thr Leu Gly 
1150 

Ala Leu Trp Lys Leu Gly 
1165 

Pro Gly Leu Asp Pro Thr 
1180 


<210> 36 

<211> 196 

<212> PRT 

<213> Homo sapiens 


<400> 36 
Met Ser Arg Arg 
1 


Arg Leu Gly Ala 
20 

He Val Pro Arg 
35 

His Leu Ser Leu 
50 


Ser Ser Cys Asn 
65 

Gin His Tyr His 


Phe Leu lie Gly 
100 

Phe Thr Gly Ala 
115 

He Ser Phe Met 
130 


Ser Met Leu Leu 
5 

Ala Gin Glu Thr 


Asn Glu Trp Lys 
40 

Pro Leu Arg Tyr 
55 

Thr Pro Ala Ser 

70 

Met Lys Thr Leu 
85 

Glu Asp Gly Leu 


His Ser Gly His 
120 

Gly Asn Tyr Met 
135 


Ala Trp Ala Leu 

10 

Glu Asp Pro Ala 
25 

Ala Leu Ala Ser 


Val Val Val Ser 
60 

Cys Gin Gin Gin 
75 

Gly Trp Cys Asp 
90 

Val Tyr Glu Gly 
105 

Leu Trp Asn Pro 


Asp Arg Val Pro 
140 


Pro Ser Leu Leu 
15 

Cys Cys Ser Pro 
30 

Glu Cys Ala Gin 
45 

His Thr Ala Gly 


Ala Arg Asn Val 
80 

Val Gly Tyr Asn 
95 

Arg Gly Trp Asn 
110 

Met Ser He Gly 

125 

Thr Pro Gin Ala 
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lie Arg Ala Ala Gin Gly Leu Leu 
145 150 

Leu Arg Ser Asn Tyr Val Leu Lys 
165 

Leu Ser Pro Gly Asn Gin Leu Tyr 
180 

Tyr Arg Ser Pro 
195 


31 

Ala Cys Gly Val Ala Gin Gly Ala 
155 160 

Gly His Arg Asp Val Gin Arg Thr 
170 175 

His Leu lie Gin Asn Trp Pro His 
185 190 


<210> 37 

<211> 319 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> SITE 
<222> {319) 

<223> Xaa equals stop translation 
<400> 37 

Met Ala Leu Met Leu Ser Leu Val Leu Ser Leu Leu Lys Leu Gly Ser 
15 10 15 

Gly Gin Trp Gin Val Phe Gly Pro Asp Lys Pro Val Gin Ala Leu Val 
20 25 30 

Gly Glu Asp Ala Ala Phe Ser Cys Phe Leu Ser Pro Lys Thr Asn Ala 
35 40 45 

Glu Ala Met Glu Val Arg Phe Phe Arg Gly Gin Phe Ser Ser Val Val 
50 55 60 

His Leu Tyr Arg Asp Gly Lys Asp Gin Pro Phe Met Gin Met Pro Gin 
65 70 75 80 

Tyr Gin Gly Arg Thr Lys Leu Val Lys Asp Ser lie Ala Glu Gly Arg 
85 90 95 

lie Ser Leu Arg Leu Glu Asn lie Thr Val Leu Asp Ala Gly Leu Tyr 
100 105 110 

Gly Cys Arg He Ser Ser Gin Ser Tyr Tyr Gin Lys Ala He Trp Glu 
115 120 125 

Leu Gin Val Ser Ala Leu Gly Ser Val Pro Leu lie Ser lie Ala Gly 
130 135 140 

Tyr Val Asp Arg Asp He Gin Leu Leu Cys Gin Ser Ser Gly Trp Phe 
14 5 150 155 160 

Pro Arg Pro Thr Ala Lys Trp Lys Gly Pro Gin Gly Gin Asp Leu Ser 
165 170 175 


Thr Asp Ser Arg Thr Asn Arg Asp Met His Gly Leu Phe Asp Val Glu 
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32 

185 


PO7US99/25031 

190 


lie Ser Leu Thr 
195 

Arg His Ala His 
210 

Asp Trp Arg Arg 
225 

Ser His lie Tyr 


lie Leu Arg Pro 
260 

Leu Lys Leu Gin 
275 

Ser Leu Leu Gin 
290 

Asn Pro Trp Ser 
305 


Val Gin Glu Asn 
200 

Leu Ser Arg Glu 
215 

Lys His Gly Gin 
230 

Asp Ser Phe Pro 
245 

Val Gly Pro Cys 


He Leu Gly Glu 
280 

He Ser Gly Gly 
295 

Phe Pro Ser Pro 
310 


Ala Gly Ser lie 


Val Glu Ser Arg 
220 

Ala Gly Lys Arg 
235 

Ser Leu Ser Phe 
250 

Arg Ala Lys Leu 
265 

Val His Phe Val 


Ser Thr Thr Leu 
300 

Cys Ala Leu Phe 
315 


Ser Cys Ser Met 
205 

Val Gin He Gly 


Lys Tyr Ser Ser 
240 

Met Asp Phe Tyr 
255 

Val Met Gly Thr 

270 

Glu Lys Pro His 
285 

Lys Lys Gly Pro 


Pro Thr Xaa 


<210> 38 

<211> 375 

<212> PRT 

<213> Homo sapiens 

<400> 38 

Met Ala Gly He Pro Gly Leu Leu Phe Leu Leu Phe Phe Leu Leu Cys 
1 5 10 15 

Ala Val Gly Gin Val Ser Pro Tyr Ser Ala Pro Trp Lys Pro Thr Trp 
20 25 30 

Pro Ala Tyr Arg Leu Pro Val Val Leu Pro Gin Ser Thr Leu Asn Leu 
35 40 45 

Ala Lys Pro Asp Phe Gly Ala Glu Ala Lys Leu Glu Val Ser Ser Ser 
50 55 60 

Cys Gly Pro Gin Cys His Lys Gly Thr Pro Leu Pro Thr Tyr Glu Glu 
65 70 75 80 

Ala Lys Gin Tyr Leu Ser Tyr Glu Thr Leu Tyr Ala Asn Gly Ser Arg 
85 90 95 

Thr Glu Thr Gin Val Gly He Tyr He Leu Ser Ser Ser Gly Asp Gly 
100 105 110 

Ala Gin His Arg Asp Ser Gly Ser Ser Gly Lys Ser Arg Arg Lys Arg 
115 120 125 


Gin He Tyr Gly Tyr Asp Ser Arg Phe Ser He Phe Gly Lys Asp Phe 
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130 


135 


140 


Leu Leu Asn Tyr Pro Phe Ser Thr Ser Val Lys Leu Ser Thr Gly Cys 
145 150 155 160 

Thr Gly Thr Leu Val Ala Glu Lys His Val Leu Thr Ala Ala His Cys 
165 170 175 

lie His Asp Gly Lys Thr Tyr Val Lys Gly Thr Gin Lys Leu Arg Val 
180 185 190 

Gly Phe Leu Lys Pro Lys Phe Lys Asp Gly Gly Arg Gly Ala Asn Asp 
195 200 205 

Ser Thr Ser Ala Met Pro Glu Gin Met Lys Phe Gin Trp lie Arg Val 
210 215 220 

Lys Arg Thr His Val Pro Lys Gly Trp lie Lys Gly Asn Ala Asn Asp 
225 230 235 240 

lie Gly Met Asp Tyr Asp Tyr Ala Leu Leu Glu Leu Lys Lys Pro His 
245 250 255 

Lys Arg Lys Phe Met Lys He Gly Val Ser Pro Pro Ala Lys Gin Leu 
260 265 270 

Pro Gly Gly Arg He His Phe Ser Gly Tyr Asp Asn Asp Arg Pro Gly 
275 280 285 

Asn Leu Val Tyr Arg Phe Cys Asp Val Lys Asp Glu Thr Tyr Asp Leu 
290 295 300 

Leu Tyr Gin Gin Cys Asp Ser Gin Pro Gly Ala Ser Gly Ser Gly Val 
305 310 315 320 

Tyr Val Arg Met Trp Lys Arg Gin His Gin Lys Trp Glu Arg Lys He 
325 330 335 

He Gly Met He Ser Gly His Gin Trp Val Asp Met Asp Gly Ser Pro 
340 345 350 

Gin Glu Phe Thr Arg Gly Cys Ser Glu He Thr Pro Leu Gin Tyr He 
355 360 365 

Pro Asp He Ser lie Gly Val 
370 375 


<210> 39 

<211> 364 

<212> PRT 

<213> Homo sapiens 


Met Ala Ser Val Val Leu Pro Ser Gly Ser Gin Cys Ala Ala Ala Ala 


<400> 39 


1 


5 


10 


15 


Ala Ala Ala Ala Pro Pro Gly Leu Arg Leu Arg Leu Leu Leu Leu Leu 
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20 25 30 

Phe Ser Ala Ala Ala Leu lie Pro Thr Gly Asp Gly Gin Asn Leu Phe 
35 40 45 

Thr Lys Asp Val Thr Val Tie Glu Gly Glu Val Ala Thr lie Ser Cys 
50 55 60 

Gin Val Asn Lys Ser Asp Asp Ser Val lie Gin Leu Leu Asn Pro Asn 
65 70 75 80 

Arg Gin Thr lie Tyr Phe Arg Asp Phe Arg Pro Leu Lys Asp Ser Arg 
85 90 95 

Phe Gin Leu Leu Asn Phe Ser Ser Ser Glu Leu Lys Val Ser Leu Thr 
100 105 HO 

Asn Val Ser lie Ser Asp Glu Gly Arg Tyr Phe Cys Gin Leu Tyr Thr 
115 120 125 

Asp Pro Pro Gin Glu Ser Tyr Thr Thr lie Thr Val Leu Val Pro Pro 
130 135 140 

Arg Asn Leu Met lie Asp lie Gin Lys Asp Thr Ala Val Glu Gly Glu 
145 150 155 160 

Glu lie Glu Val Asn Cys Thr Ala Met Ala Ser Lys Pro Ala Thr Thr 
165 170 175 

lie Arg Trp Phe Lys Gly Asn Thr Glu Leu Lys Gly Lys Ser Glu Val 
180 185 190 

Glu Glu Trp Ser Asp Met Tyr Thr Val Thr Ser Gin Leu Met Leu Lys 
195 200 205 

Val His Lys Glu Asp Asp Gly Val Pro Val lie Cys Gin Val Glu His 
210 215 220 

Pro Ala Val Thr Gly Asn Leu Gin Thr Gin Arg Tyr Leu Glu Val Gin 
22 5 230 235 240 

Tyr Lys Pro Gin Val His He Gin Met Thr Tyr Pro Leu Gin Gly Leu 
245 250 255 

Thr Arg Glu Gly Asp Ala Leu Glu Leu Thr Cys Glu Ala He Gly Lys 
260 265 270 

Pro Gin Pro Val Met Val Thr Trp Val Arg Val Asp Asp Glu Met Pro 
275 280 285 

Gin His Ala Val Leu Ser Gly Pro Asn Leu Phe He Asn Asn Leu Asn 
290 295 300 

Lys Thr Asp Asn Gly Thr Tyr Arg Cys Glu Ala Ser Asn He Val Gly 
305 310 315 320 


Lys Ala His Ser Asp Tyr Met Leu Tyr Val Tyr Asp Pro Pro Thr Thr 
325 330 335 
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lie Pro Pro Pro 
340 

Thr lie Leu Thr 
355 


Thr Thr Thr Thr 

lie lie Thr Asp 
360 


Thr Thr Thr Thr 
345 

Ser Arg Ala Arg 


Thr Thr Thr Thr 
350 


<210> 40 

<211> 66 

<212> PRT 

<213> Homo sapiens 

<400> 40 

Met Ser Ser Ser Ser Leu Lys His Leu Leu Cys Met Ala Leu Ser Trp 
15 10 15 

Phe Ser Ser Phe lie Ser Gly Glu Thr Ser Phe Ser Leu Leu Asn Ser 
20 25 30 

Phe Phe Leu Pro Tyr Pro Ser Ser Arg Cys Cys Cys Phe Ser Val Gin 
35 40 45 

Cys Ser He Leu Asp Pro Phe Ser Cys Asn Ser Met Arg Phe Pro Trp 
50 55 60 

Glu Asn 
65 


<210> 41 

<211> 469 

<212> PRT 

<213> Homo sapiens 

<400> 41 

Met Arg Pro Pro Gly Phe Arg Asn Phe Leu Leu Leu Ala Ser Ser Leu 
1 5 10 15 

Leu Phe Ala Gly Leu Ser Ala Val Pro Gin Ser Phe Ser Pro Ser Leu 
20 25 30 

Arg Ser Trp Pro Gly Ala Ala Cys Arg Leu Ser Arg Ala Glu Ser Glu 
35 40 45 

Arg Arg Cys Arg Ala Pro Gly Gin Pro Pro Gly Ala Ala Leu Cys His 
50 55 60 

Gly Arg Gly Arg Cys Asp Cys Gly Val Cys He Cys His Val Thr Glu 
65 70 75 SO 

Pro Gly Met Phe Phe Gly Pro Leu Cys Glu Cys His Glu Trp Val Cys 
85 90 95 

Glu Thr Tyr Asp Gly Ser Thr Cys Ala Gly His Gly Lys Cys Asp Cys 
100 105 110 


Gly Lys Cys Lys 


Cys Asp Gin Gly Trp Tyr Gly Asp Ala Cys Gin Tyr 
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115 120 12 5 


Pro Thr Asn Cys Asp 
13 0 

Asn Ser Gin Asp lie 
145 

Arg Cys Lys Cys Asp 
165 

Phe Cys Glu Cys Asp 
180 

lie Cys Gly Gly His 
195 

Ala Gly Trp His Gly 
210 

Trp Glu Ser Lys Arg 
225 

Ser Arg Gly Thr Cys 
245 

Pro Thr Gly Asp Trp 
260 

Glu Arg Asp Cys Arg 
275 

Ser Gly His Gly Gin 
290 


Leu Thr Lys Lys Lys Ser 
135 

lie Cys Ser Asn Ala Gly 

150 155 

Asn Ser Asp Gly Ser Gly 
170 

Asp Arg Glu Cys lie Asp 
185 

Gly Lys Cys Tyr Cys Gly 
200 

Asp Lys Cys Glu Phe Gin 
215 

Arg Cys Thr Ser Pro Asp 
230 235 

Val Cys Gly Glu Cys Thr 
250 

Gly Asp He His Gly Asp 
265 

Ala Val Tyr Asp Arg Tyr 
280 

Cys Asn Cys Gly Arg Cys 
295 


Asn Gin Met Cys Lys 
140 

Thr Cys His Cys Gly 
160 

Leu Val Tyr Gly Lys 
175 

Asp Glu Thr Glu Glu 
190 

Asn Cys Tyr Cys Lys 
205 

Cys Asp lie Thr Pro 
220 

Gly Lys He Cys Ser 
240 

Cys His Asp Val Asp 
255 

Thr Cys Glu Cys Asp 
270 

Ser Asp Asp Phe Cys 
285 

Asp Cys Lys Ala Gly 
300 


Trp Tyr Gly Lys 
305 

Glu Glu Ser lie 


Gly Arg Gly Lys 
340 

Asp Arg Arg Val 
355 

Glu Asp Leu Asp 
370 

Gly Arg Cys Val 
385 

Pro Arg Lys Cys 


Ser Ala Asp Gly 
420 


Lys Cys Glu His 
310 

Arg Lys Cys Gin 
325 

Cys Glu Cys Gly 


Tyr Gly Lys Thr 
360 

Gly Val Val Cys 
375 

Cys Glu Arg Gly 
390 

Asn Met Thr Glu 
405 

lie Leu Cys Ser 


Pro Gin Ser Cys 
315 

Gly Ser Ser Asp 
330 

Lys Cys Thr Cys 
345 

Cys Glu Cys Asp 


Gly Gly His Gly 
380 

Trp Phe Gly Lys 
395 

Glu Gin Ser Lys 
410 

Gly Lys Gly Ser 
425 


Thr Leu Ser Ala 
320 

Leu Pro Cys Ser 
335 

Tyr Pro Pro Gly 
350 

Asp Arg Arg Cys 
365 

Thr Cys Ser Cys 


Leu Cys Gin His 
400 

Asn Leu Cys Glu 
415 

Cys His Cys Gly 
430 
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Lys Cys lie Cys Ser Ala Glu Glu Trp Tyr lie Ser Gly Glu Phe Cys 
435 440 445 

Asp Cys Asp Asp Arg Asp Cys Asp Lys His Asp Gly Leu lie Cys Thr 
450 455 460 

Arg Glu Trp Asn Met 
465 


<210> 42 

<211> 127 

<212> PRT 

<213> Homo sapiens 

<400> 42 

Met Leu Leu Pro Leu Leu Leu Ser Ser Leu Leu Gly Gly Ser Gin Ala 
15 10 15 

Met Asp Gly Arg Phe Trp lie Arg Val Gin Glu Ser Val Met Val Pro 
20 25 30 

Glu Gly Leu Cys He Ser Val Pro Cys Ser Phe Ser Tyr Pro Arg Gin 
35 40 45 

Asp Trp Thr Gly Ser Thr Pro Ala Tyr Gly Tyr Trp Phe Lys Ala Val 
50 55 60 

Thr Glu Thr Thr Lys Gly Ala Pro Val Ala Thr Asn His Gin Ser Arg 
6 5 70 75 80 

Glu Val Glu Met Ser Thr Arg Gly Arg Phe Pro Gly Ser Leu Gly Asp 
85 90 95 

Pro Ala Lys Gly Asn Cys Ser Leu Val He Arg Arg Arg Ala Asp Ala 
100 105 110 

Arg Met Ser His Ser Thr Ser Phe Gly Trp Arg Glu Glu Ala Met 
115 120 125 


<210> 43 

<211> 1034 

<212> PRT 

<213> Homo sapiens 

<400> 43 

Met Asp Leu Pro Arg Gly Leu Val Val Ala Trp Ala Leu Ser Leu Trp 
1 5 10 15 

Pro Gly Phe Thr Asp Thr Phe Asn Met Asp Thr Arg Lys Pro Arg Val 
20 25 30 

He Pro Gly Ser Arg Thr Ala Phe Phe Gly Tyr Thr Val Gin Gin His 
35 40 45 

Asp He Ser Gly Asn Lys Trp Leu Val Val Gly Ala Pro Leu Glu Thr 
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55 


38 

60 
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Asn Gly Tyr Gin Lys Thr Gly Asp Val Tyr Lys Cys Pro Val lie His 

65 70 75 80 

Gly Asn Cys Thr Lys Leu Asn Leu Gly Arg Val Thr Leu Ser Asn Val 

85 90 95 

Ser Glu Arg Lys Asp Asn Met Arg Leu Gly Leu Ser Leu Ala Thr Asn 

100 105 110 


Pro Lys Asp Asn Ser Phe Leu Ala 
115 120 

Cys Gly Ser Ser Tyr Tyr Thr Thr 
130 135 

Asn Phe Arg Phe Ser Lys Thr Val 
145 150 

Thr Tyr Met Asp He Val He Val 
165 

Pro Trp Val Glu Val Gin His Phe 
180 


Cys Ser Pro Leu Trp Ser His Glu 
125 

Gly Met Cys Ser Arg Val Asn Ser 
140 

Ala Pro Ala Leu Gin Arg Cys Gin 
155 160 

Leu Asp Gly Ser Asn Ser He Tyr 
170 175 

Leu lie Asn lie Leu Lys Lys Phe 
185 190 


Tyr He Gly Pro Gly Gin He Gin 
195 200 

Asp Val Val His Glu Phe His Leu 
210 215 

Val Val Glu Ala Ala Ser His He 
225 230 

Arg Thr Ala Phe Gly He Glu Phe 
245 

Gly Gly Arg Lys Gly Ala Lys Lys 
260 

Glu Ser His Asp Ser Pro Asp Leu 
275 280 


Val Gly Val Val Gin Tyr Gly Glu 
205 

Asn Asp Tyr Arg Ser Val Lys Asp 
220 

Glu Gin Arg Gly Gly Thr Glu Thr 
235 240 

Ala Arg Ser Glu Ala Phe Gin Lys 
250 255 

Val Met He Val He Thr Asp Gly 
265 270 

Glu Lys Val He Gin Gin Ser Glu 
285 


Arg Asp Asn Val 
290 

Arg Arg Gly He 
305 

Ala Ser Asp Pro 


Ala Leu Lys Asp 
340 

Glu Gly Thr Asn 
355 


Thr Arg Tyr Ala 
295 

Asn Pro Glu Thr 
310 

Asp Asp Lys His 
325 

He Val Asp Ala 


Lys Asn Glu Thr 
360 


Val Ala Val Leu 
300 

Phe Leu Asn Glu 
315 

Phe Phe Asn Val 
330 

Leu Gly Asp Arg 

345 

Ser Phe Gly Leu 


Gly Tyr Tyr Asn 


He Lys Tyr He 
320 

Thr Asp Glu Ala 
335 

He Phe Ser Leu 
350 

Glu Met Ser Gin 
365 
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Thr Gly Phe Ser Ser His Val Val Glu Asp Gly Val Leu Leu Gly Ala 
370 375 380 

Val Gly Ala Tyr Asp Trp Asn Gly Ala Val Leu Lys Glu Thr Ser Ala 
385 390 395 400 

Gly Lys Val lie Pro Leu Arg Glu Ser Tyr Leu Lys Glu Phe Pro Glu 
405 410 415 

Glu Leu Lys Asn His Gly Ala Tyr Leu Gly Tyr Thr Val Thr Ser Val 

420 425 430 

Val Ser Ser Arg Gin Gly Arg Val Tyr Val Ala Gly Ala Pro Arg Phe 
435 440 445 

Asn His Thr Gly Lys Val He Leu Phe Thr Met His Asn Asn Arg Ser 
450 455 460 

Leu Thr He His Gin Ala Met Arg Gly Gin Gin He Gly Ser Tyr Phe 
470 475 480 

Gly Ser Glu He Thr Ser Val Asp He Asp Gly Asp Gly Val Thr Asp 
485 490 495 

Val Leu Leu Val Gly Ala Pro Met Tyr Phe Asn Glu Gly Arg Glu Arg 
500 505 510 

Gly Lys Val Tyr Val Tyr Glu Leu Arg Gin Asn Arg Phe Val Tyr Asn 
515 520 525 

Gly Thr Leu Lys Asp Ser His Ser Tyr Gin Asn Ala Arg Phe Gly Ser 
530 535 5 40 

Ser He Ala Ser Val Arg Asp Leu Asn Gin Asp Ser Tyr Asn Asp Val 
545 550 555 560 

Val Val Gly Ala Pro Leu Glu Asp Asn His Ala Gly Ala He Tyr He 
565 570 575 

Phe His Gly Phe Arg Gly Ser He Leu Lys Thr Pro Lys Gin Arg He 
580 585 590 

Thr Ala Ser Glu Leu Ala Thr Gly Leu Gin Tyr Phe Gly Cys Ser He 
595 600 605 

His Gly Gin Leu Asp Leu Asn Glu Asp Gly Leu He Asp Leu Ala Val 
610 615 620 

Gly Ala Leu Gly Asn Ala Val He Leu Trp Ser Arg Pro Val Val Gin 
625 630 635 640 

He Asn Ala Ser Leu His Phe Glu Pro Ser Lys He Asn He Phe His 
645 650 655 

Arg Asp Cys Lys Arg Ser Gly Arg Asp Ala Thr Cys Leu Ala Ala Phe 
660 665 670 
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Leu Cys Phe Thr Pro He Phe Leu Ala Pro His Phe Gin Thr Thr Thr 

675 680 685 

Val Gly He Arg Tyr Asn Ala Thr Met Asp Glu Lys Arg Tyr Thr Pro 
690 695 700 

Arg Ala His Leu Asp Glu Gly Gly Asp Arg Phe Thr Asn Arg Ala Val 
705 710 715 720 

Leu Leu Ser Ser Gly Gin Glu Leu Cys Glu Arg He Asn Phe His Val 
725 730 735 

Leu Asp Thr Ala Asp Tyr Val Lys Pro Val Thr Phe Ser Val Glu Tyr 
740 745 750 

Ser Leu Glu Asp Pro Asp His Gly Pro Met Leu Asp Asp Gly Trp Pro 

755 760 765 

Thr Thr Leu Arg Val Ser Val Pro Phe Trp Asn Gly Cys Asn Glu Asp 
770 775 780 


Glu His Cys Val Pro Asp Leu Val 
785 790 

Thr Ala Met Glu Tyr Cys Gin Arg 
805 


Leu Asp Ala Arg Ser Asp Leu Pro 

795 800 

Val Leu Arg Lys Pro Ala Gin Asp 

810 815 


Cys Ser Ala Tyr Thr Leu Ser Phe Asp Thr Thr Val Phe lie He Glu 
820 825 830 

Ser Thr Arg Gin Arg Val Ala Val Glu Ala Thr Leu Glu Asn Arg Gly 
835 840 845 

Glu Asn Ala Tyr Ser Thr Val Leu Asn He Ser Gin Ser Ala Asn Leu 
850 855 860 

Gin Phe Ala Ser Leu He Gin Lys Glu Asp Ser Asp Gly Ser lie Glu 
865 870 875 880 

Cys Val Asn Glu Glu Arg Arg Leu Gin Lys Gin Val Cys Asn Val Ser 
885 890 895 

Tyr Pro Phe Phe Arg Ala Lys Ala Lys Val Ala Phe Arg Leu Asp Phe 
900 905 910 

Glu Phe Ser Lys Ser He Phe Leu His His Leu Glu He Glu Leu Ala 
915 920 925 

Ala Gly Ser Asp Ser Asn Glu Arg Asp Ser Thr Lys Glu Asp Asn Val 
930 935 940 

Ala Pro Leu Arg Phe His Leu Lys Tyr Glu Ala Asp Val Leu Phe Thr 
945 950 955 960 

Arg Ser Ser Ser Leu Ser His Tyr Glu Val Lys Leu Asn Ser Ser Leu 
965 970 975 


Glu Arg Tyr Asp Gly He Gly Pro Pro Phe Ser Cys He Phe Arg He 
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980 985 990 

Gin Asn Leu Gly Leu Phe Pro lie His Gly He Met Met Lys He Thr 
995 1000 1005 

lie Pro He Ala Thr Arg Ser Gly Asn Arg Leu Leu Lys Leu Arg Asp 
1010 1015 1020 

Phe Leu Thr Asp Glu Gly Glu His Val Leu 
1030 


<210> 44 

<211> 461 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> SITE 
<222> (234) 

<223> Xaa equals any of the naturally occurring L-amino acids 
<220> 

<221> SITE 
<222> (236) 

<223> Xaa equals any of the naturally occurring L-amino acids 
<400> 44 

Met Ala Leu Met Leu Ser Leu Val Leu Ser Leu Leu Lys Leu Gly Ser 
15 10 15 

Gly Gin Trp Gin Val Phe Gly Pro Asp Lys Pro Val Gin Ala Leu Val 
20 25 30 

Gly Glu Asp Ala Ala Phe Ser Cys Phe Leu Ser Pro Lys Thr Asn Ala 
35 40 45 

Glu Ala Met Glu Val Arg Phe Phe Arg Gly Gin Phe Ser Ser Val Val 
50 55 60 

His Leu Tyr Arg Asp Gly Lys Asp Gin Pro Phe Met Gin Met Pro Gin 
65 70 75 80 

Tyr Gin Gly Arg Thr Lys Leu Val Lys Asp Ser He Ala Glu Gly Arg 
85 90 95 

He Ser Leu Arg Leu Glu Asn He Thr Val Leu Asp Ala Gly Leu Tyr 
100 105 HO 

Gly Cys Arg He Ser Ser Gin Ser Tyr Tyr Gin Lys Ala He Trp Glu 
115 120 125 

Leu Gin Val Ser Ala Leu Gly Ser Val Pro Leu lie Ser He Thr Gly 
130 135 140 


Tyr Val Asp Arg Asp He Gin Leu Leu Cys Gin Ser Ser Gly Trp Phe 
145 150 155 160 
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Pro Arg Pro Thr Ala Lys Trp Lys Gly Pro Gin Gly Gin Asp Leu Ser 
165 170 175 

Thr Asp Ser Arg Thr Asn Arg Asp Met His Gly Leu Phe Asp Val Glu 
180 185 190 

lie Ser Leu Thr Val Gin Glu Asn Ala Gly Ser He Ser Cys Ser Met 
195 200 205 

Arg His Ala His Leu Ser Arg Glu Val Glu Ser Arg Val Gin He Gly 
210 215 220 

Asp Thr Phe Phe Glu Pro He Ser Trp Xaa Leu Xaa Thr Lys Val Leu 
225 230 235 240 

Gly lie Leu Cys Cys Gly Leu Phe Phe Gly He Val Gly Leu Lys He 
245 250 255 

Phe Phe Ser Lys Phe Gin Trp Lys He Gin Ala Glu Leu Asp Trp Arg 
260 265 270 

Arg Lys His Gly Gin Ala Glu Leu Arg Asp Ala Arg Lys His Ala Val 
275 280 285 

Glu Val Thr Leu Asp Pro Glu Thr Ala His Pro Lys Leu Cys Val Ser 
290 295 300 

Asp Leu Lys Thr Val Thr His Arg Lys Ala Pro Gin Glu Val Pro His 
305 310 315 320 

Ser Glu Lys Arg Phe Thr Arg Lys Ser Val Val Ala Ser Gin Ser Phe 
325 330 335 

Gin Ala Gly Lys His Tyr Trp Glu Val Asp Gly Gly His Asn Lys Arg 
340 345 350 

Trp Arg Val Gly Val Cys Arg Asp Asp Val Asp Arg Arg Lys Glu Tyr 
355 360 365 

Val Thr Leu Ser Pro Asp His Gly Tyr Trp Val Leu Arg Leu Asn Gly 
370 375 380 

Glu His Leu Tyr Phe Thr Leu Asn Pro Arg Phe He Ser Val Phe Pro 
385 390 395 400 

Arg Thr Pro Pro Thr Lys He Gly Val Phe Leu Asp Tyr Glu Cys Gly 
405 410 415 

Thr He Ser Phe Phe Asn He Asn Asp Gin Ser Leu He Tyr Thr Leu 
420 425 430 

Thr Cys Arg Phe Glu Gly Leu Leu Arg Pro Tyr He Glu Tyr Pro Ser 
435 440 445 


Tyr Asn Glu Gin Asn Gly Thr Pro Arg Asp Lys Gin Gin 
450 455 460 


WO 00/29435 


PCT/US99/25031 


<210> 45 
<211> 383 
<212> PRT 

<213> Homo sapiens 


<400> 45 
Met Ala Gly lie 
1 

Ala Val Gly Gin 
20 

Pro Ala Tyr Arg 
35 

Ala Lys Pro Asp 
50 

Cys Gly Pro Gin 
65 

Ala Lys Gin Tyr 


Thr Glu Thr Gin 
100 

Ala Gin His Arg 
115 

Gin lie Tyr Gly 
130 

Leu Leu Asn Tyr 
145 

Thr Gly Thr Leu 


lie His Asp Gly 
180 

Gly Phe Leu Lys 
195 

Ser Thr Ser Ala 
210 

Lys Arg Thr His 
225 

lie Gly Met Asp 


Lys Arg Lys Phe 
260 

Pro Gly Gly Arg 


Pro Gly Leu Leu 

5 


Val Ser Pro Tyr 


Leu Pro Val Val 
40 

Phe Gly Ala Glu 
55 

Cys His Lys Gly 
70 


Leu Ser Tyr Glu 
85 


Val Gly lie Tyr 


Asp Ser Gly Ser 
120 


Tyr Asp Ser Arg 
135 


Pro Phe Ser Thr 
150 

Val Ala Glu Lys 
165 

Lys Thr Tyr Val 


Pro Lys Phe Lys 
200 

Met Pro Glu Gin 
215 

Val Pro Lys Gly 
230 


Tyr Asp Tyr Ala 
245 


Met Lys He Gly 


He His Phe Ser 


Phe Leu Leu Phe 
10 

Ser Ala Pro Trp 

25 

Leu Pro Gin Ser 


Ala Lys Leu Glu 
60 

Thr Pro Leu Pro 
75 

Thr Leu Tyr Ala 

90 

lie Leu Ser Ser 
105 

Ser Gly Lys Ser 


Phe Ser He Phe 
140 

Ser Val Lys Leu 
155 

His Val Leu Thr 
170 

Lys Gly Thr Gin 
185 

Asp Gly Gly Arg 


Met Lys Phe Gin 
220 

Trp He Lys Gly 
235 

Leu Leu Glu Leu 
250 

Val Ser Pro Pro 
265 

Gly Tyr Asp Asn 


Phe Leu Leu Cys 
15 

Lys Pro Thr Trp 
30 

Thr Leu Asn Leu 
45 

Val Ser Ser Ser 


Thr Tyr Glu Glu 
80 

Asn Gly Ser Arg 
95 

Ser Gly Asp Gly 

110 

Arg Arg Lys Arg 
125 

Gly Lys Asp Phe 


Ser Thr Gly Cys 
160 

Ala Ala His Cys 
175 

Lys Leu Arg Val 
190 

Gly Ala Asn Asp 
205 

Trp He Arg Val 


Asn Ala Asn Asp 
240 

Lys Lys Pro His 
255 

Ala Lys Gin Leu 
270 

Asp Arg Pro Gly 
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Asn Leu Val Tyr 
290 

Leu Tyr Gin Gin 
305 

Tyr Val Arg Met 


Tie Gly lie Phe 
340 

Gin Asp Phe Asn 

355 

lie Cys Tyr Trp 
370 


Arg Phe Cys Asp 
295 

Cys Asp Ala Gin 
310 

Trp Lys Arg Gin 
325 

Ser Gly His Gin 


Val Ala Val Arg 
360 

lie Lys Gly Asn 
375 


Val Lys Asp Glu 
300 

Pro Gly Ala Ser 
315 

Gin Gin Lys Trp 
330 

Trp Val Asp Met 
345 

lie Thr Pro Leu 


Tyr Leu Asp Cys 
380 


Thr Tyr Asp Leu 


Gly Ser Gly Val 
320 

Glu Arg Lys lie 

335 

Asn Gly Ser Pro 
350 

Lys Tyr Ala Gin 

365 

Arg Glu Gly 


<210> 46 

<211> 229 

<212> PRT 

<213> Homo sapiens 

<400> 46 

Met lie Asp He Gin Lys Asp Thr Ala Val Glu Gly Glu Glu He Glu 
1 5 10 15 

Val Asn Cys Thr Ala Met Ala Ser Lys Pro Ala Thr Thr He Arg Trp 
20 25 30 

Phe Lys Gly Asn Thr Glu Leu Lys Gly Lys Ser Glu Val Glu Glu Trp 
35 40 45 

Ser Asp Met Tyr Thr Val Thr Ser Gin Leu Met Leu Lys Val His Lys 
50 55 60 

Glu Asp Asp Gly Val Pro Val He Cys Gin Val Glu His Pro Ala Val 
65 70 75 80 

Thr Gly Asn Leu Gin Thr Gin Arg Tyr Leu Glu Val Gin Tyr Lys Pro 
85 90 95 

Gin Val His He Gin Met Thr Tyr Pro Leu Gin Gly Leu Thr Arg Glu 
100 105 HO 

Gly Asp Ala Leu Glu Leu Thr Cys Glu Ala He Gly Lys Pro Gin Pro 
115 120 125 

Val Met Val Thr Trp Val Arg Val Asp Asp Glu Met Pro Gin His Ala 
130 135 140 

Val Leu Ser Gly Pro Asn Leu Phe He Asn Asn Leu Asn Lys Thr Asp 
145 150 155 160 

Asn Gly Thr Tyr Arg Cys Glu Ala Ser Asn He Val Gly Lys Ala His 
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Ser Asp Tyr Met 
180 

Pro Thr Thr Thr 
195 

Thr He lie Thr 
210 

Gin Trp He Met 

225 


165 

Leu Tyr Val Tyr 

Thr Thr Thr Thr 
200 

Asp Ser Pro Ser 
215 

Pro 


45 
170 

Asp Pro Pro Thr 
185 

Thr Thr Thr Thr 


Gin Val Lys Lys 
220 


175 

Thr lie Pro Pro 
190 

Thr Thr He Leu 
205 

Ala Arg Ser Gly 


<210> 47 

<211> 724 

<212> PRT 

<213> Homo sapiens 

<400> 47 

Met Leu Arg Thr Ser Thr Pro Asn Leu Cys Gly Gly Leu His Cys Arg 
1 5 io is 

Ala Pro Trp Leu Ser Ser Gly He Leu Cys Leu Cys Leu He Phe Leu 
20 25 30 

Leu Gly Gin Val Gly Leu Leu Gin Gly His Pro Gin Cys Leu Asp Tyr 
35 40 45 

Gly Pro Pro Phe Gin Pro Pro Leu His Leu Glu Phe Cys Ser Asp Tyr 
50 55 60 

Glu Ser Phe Gly Cys Cys Asp Gin His Lys Asp Arg Arg He Ala Ala 
65 70 75 80 

Arg Tyr Trp Asp He Met Glu Tyr Phe Asp Leu Lys Arg His Glu Leu 
85 go 95 

Cys Gly Asp Tyr He Lys Asp He Leu Cys Gin Glu Cys Ser Pro Tyr 
100 105 HQ 

Ala Ala His Leu Tyr Asp Ala Glu Asn Thr Gin Thr Pro Leu Arg Asn 
H5 120 125 

Leu Pro Gly Leu Cys Ser Asp Tyr Cys Ser Ala Phe His Ser Asn Cys 
130 135 140 

His Ser Ala He Ser Leu Leu Thr Asn Asp Arg Gly Leu Gin Glu Ser 
145 150 155 160 

His Gly Arg Asp Gly Thr Arg Phe Cys His Leu Leu Asp Leu Pro Asp 
165 170 175 

Lys Asp Tyr Cys Phe Pro Asn Val Leu Arg Asn Asp Tyr Leu Asn Arg 
180 185 190 

His Leu Gly Met Val Ala Gin Asp Pro Gin Gly Cys Leu Gin Leu Cys 
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46 

200 


205 
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Leu Ser Glu Val Ala Asn Gly Leu Arg Asn Pro Val Ser Met Val His 
210 215 220 

Ala Gly Asp Gly Thr His Arg Phe Phe Val Ala Glu Gin Val Gly Val 
225 230 235 240 

Val Trp Val Tyr Leu Pro Asp Gly Ser Arg Leu Glu Gin Pro Phe Leu 
245 250 255 

Asp Leu Lys Asn He Val Leu Thr Thr Pro Trp He Gly Asp Glu Arg 
360 265 270 

Gly Phe Leu Gly Leu Ala Phe His Pro Lys Phe Arg His Asn Arg Lys 
275 280 285 

Phe Tyr He Tyr Tyr Ser Cys Leu Asp Lys Lys Lys Val Glu Lys He 
290 295 300 

Arg He Ser Glu Met Lys Val Ser Arg Ala Asp Pro Asn Lys Ala Asp 
305 310 315 320 

Leu Lys Ser Glu Arg Val He Leu Glu He Glu Glu Pro Ala Ser Asn 
325 330 335 

His Asn Gly Gly Gin Leu Leu Phe Gly Leu Asp Gly Tyr Met Tyr He 
340 345 350 

Phe Thr Gly Asp Gly Gly Gin Ala Gly Asp Pro Phe Gly Leu Phe Gly 
355 360 365 

Asn Ala Gin Asn Lys Ser Ser Leu Leu Gly Lys Val Leu Arg He Asp 
370 375 380 

Val Asn Arg Ala Gly Ser His Gly Lys Arg Tyr Arg Val Pro Ser Asp 
385 390 395 400 

Asn Pro Phe Val Ser Glu Pro Gly Ala His Pro Ala He Tyr Ala Tyr 
405 410 415 

Gly He Arg Asn Met Trp Arg Cys Ala Val Asp Arg Gly Asp Pro He 
420 425 430 

Thr Arg Gin Gly Arg Gly Arg He Phe Cys Gly Asp Val Gly Gin Asn 
435 440 445 

Arg Phe Glu Glu Val Asp Leu He Leu Lys Gly Gly Asn Tyr Gly Trp 
450 455 460 

Arg Ala Lys Glu Gly Phe Ala Cys Tyr Asp Lys Lys Leu Cys His Asn 
465 470 475 480 

Ala Ser Leu Asp Asp Val Leu Pro He Tyr Ala Tyr Gly His Ala Val 
485 490 495 

Gly Lys Ser Val Thr Gly Gly Tyr Val Tyr Arg Gly Cys Glu Ser Pro 
500 505 510 
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Asn Leu Asn Gly 
515 

Met Ala Leu Gin 
530 

Leu Cys Leu Gly 
545 

Thr His Ser Lys 


Leu Tyr Phe Leu 
580 

Ser lie Tyr Lys 
595 

Cys Lys Tyr Lys 
610 


Leu Tyr He Phe 
520 

Glu Asp Arg Lys 
535 

Ser Thr Thr Ser 
550 

Phe He He Ser 
565 

Ala Thr Ser Tyr 


Phe Val Asp Pro 
600 

Pro Val Pro Val 
615 


Gly Asp Phe Met 


Asn Lys Lys Trp 
540 

Cys Ala Phe Pro 
555 

Phe Ala Glu Asp 
570 

Pro Ser Ala Tyr 
585 

Ser Arg Arg Ala 


Arg Thr Lys Ser 
620 


Ser Gly Arg Leu 
525 

Lys Lys Gin Asp 


Gly Leu He Ser 
560 

Glu Ala Gly Glu 
575 

Ala Pro Arg Gly 
590 

Pro Pro Gly Lys 
605 

Lys Arg He Pro 


Phe Arg Pro Leu Ala Lys Thr Val 
625 630 

Glu Lys Ala Ala Arg Lys Ser Ser 
645 

Ala Gin Gly Leu Ser Glu Lys Gly 
660 


Leu Asp Leu Leu Lys Glu Gin Ser 
635 640 

Ser Ala Thr Leu Ala Ser Gly Pro 
650 655 

Ser Ser Lys Lys Leu Ala Ser Pro 
665 670 


Thr Ser Ser Lys Asn Thr Leu Arg Gly Pro Gly Thr Lys Lys Lys Ala 
675 680 685 

Arg Val Gly Pro His Val Arg Gin Gly Lys Arg Arg Lys Ser Leu Lys 
690 695 700 

Ser His Ser Gly Arg Met Arg Pro Ser Ala Glu Gin Lys Arg Ala Gly 
705 710 715 720 

Arg Ser Leu Pro 


<210> 48 

<211> 995 

<212> PRT 

<213> Homo sapiens 


<400> 48 

Met Pro Lys Pro Thr Pro Asn Ser 

1 5 

Gly Cys Arg Thr Gly Met His Met 

20 


Glu Arg Val Ser Val Arg Phe Pro 

10 15 

He Ser Val Ser Leu Arg Leu Val 
25 30 


Phe Cys Ser Phe He Phe Lys Ala Gly Val Leu Leu Gly His Pro Gin 
35 40 45 
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Cys Leu Asp Tyr 
50 

Cys Ser Glu Tyr 
65 

Val He Ala Glu 


Asn Tyr His He 
100 

Cys Ser Pro Tyr 
115 

Pro Leu Arg Val 
130 

His Leu Lys Cys 
145 

lie Arg Glu Ser 


Gly Pro Pro Phe 
55 

Glu Thr Phe Gly 
70 

Lys Tyr Trp Ser 
85 

Cys Gly Gly Tyr 


Ala Ala His Leu 
120 

lie Pro Gly Leu 
135 

Gin Asn Ser He 

150 

Cys Asp Lys Gly 
165 


Lys Pro Leu Val 
60 

Cys Cys Asp Gin 
75 

He Met Asp Tyr 
90 

lie Lys Asp He 

105 

Tyr Asp Ala Glu 


Cys Phe Asn Tyr 
140 

Thr Leu Leu Thr 
155 

Arg Asp Leu Phe 
170 


His Leu Glu Phe 


Asp Arg Asp Asn 
80 

Phe Asp Leu Asn 

95 

Leu Cys Gin Glu 
110 

Asp Pro His Thr 
125 

Cys Ser Glu Phe 


Glu Asp Lys Gin 
160 

Cys Ser Leu Leu 
175 


Asn Leu Pro Asp Glu Asp Tyr Cys Phe Pro Asn Val Leu His Asn Thr 
180 185 190 

Glu Leu Asn Asn Asn Leu Gly Ser Val Val Glu Asp Pro Glu Gly Cys 
195 200 205 

He Lys Leu Cys Leu He Glu Val Ala Asn Gly Leu Arg Asn Pro Val 

210 215 220 


Leu Met Leu His Ala Asn Asp Gly 
225 230 

Gin He Gly Phe Val Trp Val Tyr 
245 

Glu Pro Phe Leu Asn Leu Arg Arg 
260 

Gly Asp Glu Arg Gly Leu Leu Gly 
275 280 

Asn Asn Arg Lys Phe Tyr Val Tyr 
290 295 


Thr His Arg Met Phe Val Ala Glu 
235 240 

Leu Pro Asp Gly Ser Arg Leu Tyr 
250 255 

Thr Val Leu Ala Thr Pro Trp Leu 
265 270 

Met Ala Phe His Pro Lys Tyr Gin 
285 

Tyr Ser He Met Asp Glu Tyr Arg 
300 


Asn Glu Lys He Arg He Ser Glu 
305 310 

Asn Lys Ala Asp Pro Tyr Ser Glu 
325 

Pro Ala Ala Asn His Asn Gly Gly 
340 


Phe Gin Val Glu Glu His Asp He 
315 320 

Arg Arg He Leu Glu He Glu Glu 
330 335 

Gin He Leu Phe Gly Lys Asp Gly 
345 350 
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Tyr Leu Tyr He Phe Thr Gly Asp Gly Gly Lys Ala Gly Asp Pro Phe 
355 360 — 


365 


Gly Arg Phe Gly Asn Ala Gin Asn Lys Ser Val Leu Leu Gly Lys Val 
370 375 380 

Leu Arg He Asp Val Asp Gly Arg Arg Ala Asn Gly Lys Pro Tyr Gly 
385 390 395 400 

He Pro Ser Asp Asn Pro Phe Leu Ser Glu Arg Gly Ala Ala Pro Glu 
405 410 415 

Val His Ala Tyr Gly Val Arg Asn Met Trp Arg Cys Ser Val Asp Gin 
420 425 430 

Gly Asp Pro Val Thr Gly Arg Gly Lys Gly Arg He Phe Cys Gly Asp 
435 440 445 

Val Gly Gin Asn Arg Phe Gly Glu Asp Asp He He Val He Gly Gly 
450 455 460 

Asn Tyr Gly Trp Arg Ala Lys Glu Gly Phe Glu Cys Phe Asp Leu Lys 
465 470 475 480 

Leu Cys Gin Asn Ser Ser Leu Asp Asp He Leu Pro He Phe Ala Tyr 
485 490 495 

Gly His Gin Val Gly Lys Ser Val Thr Gly Gly Tyr Val Tyr Arg Gly 
500 505 510 

Cys Glu Ser Pro Asn Leu Asn Gly Val Tyr He Phe Gly Asp Phe Met 
51 5 520 525 

Asn Gly Arg Leu Met Ala Leu Gin Glu Asp Gly Val Thr Gly Thr Trp 
530 535 540 

Lys Lys Gin Asp He Cys Met Gly Asp Ser Thr He Cys Ala Phe Pro 
545 550 555 560 

Arg Leu He Asn Lys Tyr Ser Lys Phe He He Ser Phe Gly Glu Asp 
565 570 575 

Glu Ala Gly Glu Leu Leu Phe Leu Ser Thr Ser Gin Ala Ser Ala Tyr 
580 585 590 

Ser Pro Gin Gly Ser He Tyr Lys Leu Val Asp Pro Ser Arg Arg Ala 
595 600 605 

Ala Pro Gly Lys Cys Lys Tyr Lys Pro Val Pro Val Lys Thr Arg Ser 
610 615 620 

Lys Leu Val Pro Phe He Pro Lys Glu Lys Thr Val Leu Glu He Val 
625 630 635 640 

Asn Glu Ser Val Lys Pro Thr Lys Ala Pro Arg Lys Lys Thr Pro Thr 
64 5 650 655 

Lys Phe Pro Thr Lys Val Pro Pro Thr Pro Thr Lys Phe Pro Thr Lys 
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660 665 670 

Val Pro Pro Thr Pro Thr Gin Phe Pro Thr Lys Val Pro Pro lie Pro 
6 75 680 685 

Thr Lys Val Pro Ser Lys Val Pro Pro Thr Pro Thr Gin Phe Pro Thr 
690 695 700 

Lys Val Pro Pro Thr Pro Thr Lys Val Ser Thr Lys Val Leu Ser Thr 
705 710 715 720 

Pro Thr He Ala His Thr Lys Val Ser Pro Thr Ser Thr Lys Leu Pro 
725 730 735 

Ser Lys Ala Pro Ser Thr Gin Thr Met Val Pro Thr Lys Val His Pro 
740 745 750 

Thr Pro Thr Lys Leu Pro Thr Lys Val Pro Pro He Thr Thr Lys Val 
7 55 760 765 

Ser Asn Lys Val Leu Leu Thr Ser Pro Glu Leu Pro Thr Lys Val Pro 
770 775 780 

Pro Thr Pro Thr Lys Leu Pro Thr Asn Ala Pro Pro Thr Ser He Leu 
785 790 795 800 

Leu Ser Pro Thr Pro He Lys Leu Pro Thr Lys He Ser Leu Thr Leu 
805 810 815 

Thr Ser Val Pro He Lys Asn Gin Leu Thr Ser Ala Lys Leu Leu Thr 
820 825 830 

Thr Thr Leu Pro He Ser Thr Lys Arg Ala Thr Lys Leu Pro Ser Thr 
835 840 845 

Ser Thr Ser Val Pro Ser Asn Thr Ser Cys He Leu Thr His Val Gin 
850 855 860 

Pro Lys Met Leu Pro Thr Glu Thr Arg Val Pro Asn Lys Met Pro Pro 
865 870 875 880 

Lys Pro Thr Arg He Pro Thr Met Ser Met Tyr He Thr Lys Lys Pro 
885 890 895 

Pro Leu Lys Lys Asn Ser Ala Lys Lys Val Thr Asp Lys Arg Pro Thr 
900 905 910 

Lys Ser Pro Lys Thr Thr Lys Pro Pro Lys Pro Pro Lys Ser Lys Thr 
915 920 925 

Ser Val Val Asn Gin Pro Lys Lys Lys Glu Thr Lys Thr Gly Val Asn 
930 935 940 

Asn Lys Thr Lys Asn Leu Pro Pro Lys Ala Lys Glu Pro Lys Lys Glu 
945 950 955 960 


Lys Lys Thr He Lys Val Lys Gin Pro Val Ser His Tyr Phe Pro Pro 
965 970 975 
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51 


PCT/US99/25031 


Gin Lys Pro Lys Lys Gin Lys lie Lys Lys Met Gin Lys Glu Gly Asn 
980 985 990 

Glu Lys Ser 
995 


<210> 49 

<211> 700 

<212> PRT 

<213> Homo sapiens 

<400> 49 

Met Leu Lys Met Leu Ser Phe Lys Leu Leu Leu Leu Ala Val Ala Leu 
1 5 10 is 

Gly Phe Phe Glu Gly Asp Ala Lys Phe Gly Glu Arg Ser Glu Gly Ser 
20 25 30 

Gly Ala Arg Arg Arg Arg Cys Leu Asn Gly Asn Pro Pro Lys Arg Leu 
35 40 45 

Lys Arg Arg Asp Arg Arg Val Met Ser Gin Leu Glu Leu Leu Ser Gly 
50 55 60 

Gly Glu lie Leu Cys Gly Gly Phe Tyr Pro Arg Val Ser Cys Cys Leu 
65 70 75 80 

Gin Ser Asp Ser Pro Gly Leu Gly Arg Leu Glu Asn Lys He Phe Ser 
85 90 95 

Ala Thr Asn Asn Ser Glu Cys Ser Arg Leu Leu Glu Glu He Gin Cys 
100 105 HO 

Ala Pro Cys Ser Pro His Ser Gin Ser Leu Phe Tyr Thr Pro Glu Arg 
115 120 125 

Asp Val Leu Asp Gly Asp Leu Ala Leu Pro Leu Leu Cys Lys Asp Tyr 
130 135 140 

Cys Lys Glu Phe Phe Tyr Thr Cys Arg Gly His He Pro Gly Leu Leu 
145 150 155 160 

Gin Thr Thr Ala Asp Glu Phe Cys Phe Tyr Tyr Ala Arg Lys Asp Ala 
165 170 175 

Gly Leu Cys Phe Pro Asp Phe Pro Arg Lys Gin Val Arg Gly Pro Ala 
180 185 190 

Ser Asn Tyr Leu Gly Gin Met Glu Asp Tyr Glu Lys Val Gly Gly He 
195 200 205 

Ser Arg Lys His Lys His Asn Cys Leu Cys Val Gin Glu Val Met Ser 
21° 215 220 

Gly Leu Arg Gin Pro Val Ser Ala Val His Ser Gly Asp Gly Ser His 
225 230 235 240 
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Arg Leu Phe He Leu Glu Lys Glu Gly Tyr Val Lys He Leu Thr Pro 
245 250 255 

Glu Gly Glu Leu Phe Lys Glu Pro Tyr Leu Asp He His Lys Leu Val 
260 265 270 

Gin Ser Gly He Lys Gly Gly Asp Glu Arg Gly Leu Leu Ser Leu Ala 
275 280 285 

Phe His Pro Asn Tyr Lys Lys Asn Gly Lys Leu Tyr Val Ser Tyr Thr 
290 295 300 

Thr Asn Gin Glu Arg Trp Ala He Gly Pro His Asp His He Leu Arg 
305 310 315 320 

Val Val Glu Tyr Thr Val Ser Arg Lys Asn Pro His Gin Val Asp Val 
325 330 335 

Arg Thr Ala Arg Val Phe Leu Glu Val Ala Glu Leu His Arg Lys His 
340 345 350 

Leu Gly Gly Gin Leu Leu Phe Gly Pro Asp Gly Phe Leu Tyr He He 
355 360 365 

Leu Gly Asp Gly Met He Thr Leu Asp Asp Met Glu Glu Met Asp Gly 
370 375 380 

Leu Ser Asp Phe Thr Gly Ser Val Leu Arg Leu Asp Val Asp Thr Asp 
385 390 395 400 

Met Cys Asn Val Pro Tyr Ser He Pro Arg Ser Asn Pro His Phe Asn 
405 410 415 

Ser Thr Asn Gin Pro Pro Glu Val Phe Ala His Gly Leu His Asp Pro 
420 425 430 

Gly Arg Cys Ala Val Asp Arg His Pro Thr Asp He Asn He Asn Leu 
435 440 445 

Thr He Leu Cys Ser Asp Ser Asn Gly Lys Asn Arg Ser Ser Ala Arg 
450 455 460 

He Leu Gin He He Lys Gly Arg Asp Tyr Glu Ser Glu Pro Ser Leu 
465 470 475 480 

Leu Glu Phe Lys Pro Phe Ser Asn Gly Pro Leu Val Gly Gly Phe Val 
485 490 495 

Tyr Arg Gly Cys Gin Ser Glu Arg Leu Tyr Gly Ser Tyr Val Phe Gly 
500 505 510 

Asp Arg Asn Gly Asn Phe Leu Thr Leu Gin Gin Ser Pro Val Thr Lys 
51 5 520 525 

Gin Trp Gin Glu Lys Pro Leu Cys Leu Gly Ala Ser Ser Ser Cys Arg 
530 535 540 
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Gly Tyr Phe Ser 
545 

Glu Val Tyr He 


Gly Lys Leu Tyr 
580 

Glu Cys Arg Val 
595 

Ser Arg Leu Cys 
610 

Cys Ser Pro Gly 
625 

Pro Ala Cys Arg 


Cys Lys Lys Gly 
660 

Val Arg Arg Val 
675 

Thr Ser Tyr Leu 
690 


Gly His lie Leu 
550 

Leu. Ser Ser Ser 
565 

Lys He Val Asp 


Thr Val Gin Pro 
600 

Arg Asn Gly Tyr 
615 

Trp Glu Gly Asp 
630 

His Gly Gly Val 
645 

Tyr Leu Gly Pro 


Thr Arg Ala Gly 
680 

Leu Asp Leu Thr 
695 


53 

Gly Phe Gly Glu 

555 

Lys Ser Met Thr 
570 

Pro Lys Arg Pro 
585 

Ala Gin Pro Leu 


Tyr Thr Pro Thr 
620 

Phe Cys Arg He 
635 

Cys Val Arg Pro 
650 

Gin Cys Glu Gin 
665 

lie Leu Asp Gin 


Ser Tyr lie Val 
700 


Asp Glu Leu Gly 
560 

Gin Thr His Asn 
575 

Leu Met Pro Glu 
590 

Thr Ser Asp Cys 
605 

Gly Lys Cys Cys 


Ala Lys Cys Glu 
640 

Asn Lys Cys Leu 
655 

Val Asp Arg Asn 
670 

lie lie Asp Met 
685 


<210> 50 

<211> 567 

<212> PRT 

<213> Homo sapiens 

<400> 50 

Thr Ser Thr Pro Pro Arg Ala Val Pro Leu Pro Lys Ser Ser Gin Ala 
15 10 15 

Ala His Gin Arg Asn Cys Asn Ser Gly Trp Ser Pro Gly Pro Ala Ser 
20 25 30 

Leu Gly Val Arg Gly Ser Val Cys Pro Ala He Cys Trp Trp His Leu 
35 40 45 

Ser Leu Leu Pro Pro Pro Ser Val Asn Pro Thr Leu Gin Lys Cys Ser 
50 55 60 

Ser Pro Gly Ala Ala Gin Glu Leu Ser Met Arg Pro Pro Gly Phe Arg 
65 70 75 80 

Asn Phe Leu Leu Leu Ala Ser Ser Leu Leu Phe Ala Gly Leu Ser Ala 
85 90 95 


Val Pro Gin Ser Phe Ser Pro Ser Leu Arg Ser Trp Pro Gly Ala Ala 
100 105 HO 
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Cys Arg Leu Ser Arg Ala Glu Ser Glu Arg Arg Cys Arg Ala Pro Gly 
H5 120 125 

Gin Pro Pro Gly Ala Ala Leu Cys His Gly Arg Gly Arg Cys Asp Cys 
130 135 140 

Gly Val Cys lie Cys His Val Thr- Glu Pro Gly Met Phe Phe Gly Pro 
145 150 155 160 

Leu Cys Glu Cys His Glu Trp Val Cys Glu Thr Tyr Asp Gly Ser Thr 
165 170 175 

Cys Ala Gly His Gly Lys Cys Asp Cys Gly Lys Cys Lys Cys Asp Gin 
ISO 185 190 

Gly Trp Tyr Gly Asp Ala Cys Gin Tyr Pro Thr Asn Cys Asp Leu Thr 
195 200 205 

Lys Lys Lys Ser Asn Gin Met Cys Lys Asn Ser Gin Asp lie He Cys 
210 215 220 

Ser Asn Ala Gly Thr Cys His Cys Gly Arg Cys Lys Cys Asp Asn Ser 
225 230 235 240 

Asp Gly Ser Gly Leu Val Tyr Gly Lys Phe Cys Glu Cys Asp Asp Arg 
245 250 255 

Glu Cys lie Asp Asp Glu Thr Glu Glu He Cys Gly Gly His Gly Lys 
260 265 270 

Cys Tyr Cys Gly Asn Cys Tyr Cys Lys Ala Gly Trp His Gly Asp Lys 
27 5 280 285 

Cys Glu Phe Gin Cys Asp He Thr Pro Trp Glu Ser Lys Arg Arg Cys 
290 295 300 

Thr Ser Pro Asp Gly Lys He Cys Ser Asn Arg Gly Thr Cys Val Cys 
305 310 315 320 

Gly Glu Cys Thr Cys His Asp Val Asp Pro Thr Gly Asp Trp Gly Asp 
325 330 335 

He His Gly Asp Thr Cys Glu Cys Asp Glu Arg Asp Cys Arg Ala Val 
340 345 350 

Tyr Asp Arg Tyr Ser Asp Asp Phe Cys Ser Gly His Gly Gin Cys Asn 
355 360 365 

Cys Gly Arg Cys Asp Cys Lys Ala Gly Trp Tyr Gly Lys Lys Cys Glu 
370 375 380 

His Pro Gin Ser Cys Thr Leu Ser Ala Glu Glu Ser He Arg Lys Cys 
385 390 395 400 

Gin Gly Ser Ser Asp Leu Pro Cys Ser Gly Arg Gly Lys Cys Glu Cys 
405 410 415 

Gly Lys Cys Thr Cys Tyr Pro Pro Gly Asp Arg Arg Val Tyr Gly Lys 


WO 00/29435 PCT/US99/25031 

55 

420 425 430 

Thr Cys Glu Cys Asp Asp Arg Arg Cys Glu Asp Leu Asp Gly Val Val 
435 440 445 

Cys Gly Gly His Gly Thr Cys Ser Cys Gly Arg Cys Val Cys Glu Arg 
450 455 460 

Gly Trp Phe Gly Lys Leu Cys Gin His Pro Arg Lys Cys Asn Met Thr 
465 470 475 480 

Glu Glu Gin Ser Lys Asn Leu Cys Glu Ser Ala Asp Gly He Leu Cys 
485 490 495 

Ser Gly Lys Gly Ser Cys His Cys Gly Lys Cys He Cys Ser Ala Glu 
500 505 510 

Glu Trp Tyr He Ser Gly Glu Phe Cys Asp Cys Asp Asp Arg Asp Cys 
515 520 525 

Asp Lys His Asp Gly Leu He Cys Thr Gly Asn Gly He Cys Ser Cys 
530 535 540 

Gly Asn Cys Glu Cys Trp Asp Gly Trp Asn Gly Asn Ala Cys Glu He 
545 550 555 560 

Trp Leu Gly Ser Glu Tyr Pro 
565 


<210> 51 

<211> 29 

<212> PRT 

<213> Homo sapiens 

<400> 51 

Gly Lys Cys Asp Cys Gly Lys Cys 
1 5 

Asp Ala Cys Gin Tyr Pro Thr Asn 
20 


Lys Cys Asp Gin Gly Trp Tyr Gly 
10 15 

Cys Asp Leu Thr Lys 

25 


<210> 52 

<211> 29 

<212> PRT 

<213> Homo sapiens 

<400> 52 

Gly Gly His Gly Lys Cys Tyr Cys 
1 5 

Trp His Gly Asp Lys Cys Glu Phe 
20 


Gly Asn Cys Tyr Cys Lys Ala Gly 
10 15 

Gin Cys Asp lie Thr 
25 


<210> 53 
<211> 29 
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<212> PRT 

<213> Homo sapiens 


<400> 53 

His Gly Gin Cys Asn Cys Gly Arg 
1 5 

Gly Lys Lys Cys Glu His Pro Gin 
20 


Cys Asp Cys Lys Ala Gly Trp Tyr 
10 15 

Ser Cys Thr Leu Ser 
25 


<210> 54 

<211> 29 

<212> PRT 

<213> Homo sapiens 


<400> 54 

His Gly Thr Cys Ser Cys Gly Arg 
1 5 

Gly Lys Leu Cys Gin His Pro Arg 
20 


Cys Val Cys Glu Arg Gly Trp Phe 
10 15 

Lys Cys Asn Met Thr 
25 


<210> 55 
<211> 29 
<212> PRT 
<213> Homo 


sapiens 


<400> 55 

Gly Asn Gly lie Cys Ser Cys Gly 
1 5 

Asn Gly Asn Ala Cys Glu lie Trp 
20 


Asn Cys Glu Cys Trp Asp Gly Trp 
10 15 

Leu Gly Ser Glu Tyr 
25 


<210> 56 

<211> 29 

<212> PRT 

<213> Homo sapiens 


<400> 56 

Gly Asn Gly lie Cys Ser Cys Gly 

1 5 

Asn Gly Asn Ala Cys Glu lie Trp 
20 


Asn Cys Glu Cys Trp Asp Gly Trp 
10 15 

Leu Gly Ser Glu Tyr 
25 


<210> 57 

<211> 29 

<212> PRT 

<213> Homo sapiens 


Gly Asn Gly lie Cys Ser Cys Gly Asn Cys Glu Cys Trp Asp Gly Trp 
15 10 15 
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Asn Gly Asn Ala Cys Glu lie Trp Leu Gly Ser Glu Tyr 
20 25 


<210> 58 

<211> 33 

<212> PRT 

<213> Homo sapiens 

<400> 58 

Glu Thr Tyr Asp Gly Ser Thr Cys 
1 5 

Gly Lys Cys Lys Cys Asp Gin Gly 
20 

Pro 


Ala Gly His Gly Lys Cys Asp Cys 
10 15 

Trp Tyr Gly Asp Ala Cys Gin Tyr 
25 30 


<210> 59 

<211> 34 

<212> PRT 

<213> Homo sapiens 

<400> 59 

Met Cys Lys Asn Ser Gin Asp lie 
1 5 

His Cys Gly Arg Cys Lys Cys Asp 
20 

Tyr Gly 


He Cys Ser Asn Ala Gly Thr Cys 

10 15 

Asn Ser Asp Gly Ser Gly Leu Val 
25 30 


<210> 60 

<211> 31 

<212> PRT 

<213> Homo sapiens 

<400> 60 

He Asp Asp Glu Thr Glu Glu lie 
1 5 

Cys Gly Asn Cys Tyr Cys Lys Ala 
20 


Cys Gly Gly His Gly Lys Cys Tyr 
10 15 

Gly Trp His Gly Asp Lys Cys 
25 30 


<210> 61 
<211> 34 
<212> PRT 

<213> Homo sapiens 


<400> 61 

Lys Arg Arg Cys Thr Ser Pro Asp Gly Lys He Cys Ser Asn Arg Gly 
1 5 m i c 
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Thr Cys Val Cys Gly Glu Cys Thr Cys His Asp Val Asp Pro Thr Gly 
20 25 30 

Asp Trp 


<210> 62 

<211> 34 

<212> PRT 

<213> Homo sapiens 

<400> 62 

Asp Arg Tyr Ser Asp Asp Phe Cys 
1 5 

Gly Arg Cys Asp Cys Lys Ala Gly 
20 

Pro Gin 


Ser Gly His Gly Gin Cys Asn Cys 
10 15 

Trp Tyr Gly Lys Lys Cys Glu His 
25 30 


<210> 63 
<211> 33 
<212> PRT 

<213> Homo sapiens 
<400> 63 

Cys Gin Gly Ser Ser Asp Leu Pro 
1 5 

Cys Gly Lys Cys Thr Cys Tyr Pro 
20 


Cys Ser Gly Arg Gly Lys Cys Glu 
10 15 

Pro Gly Asp Arg Arg Val Tyr Gly 
25 30 


<210> 64 

<211> 31 

<212> PRT 

<213> Homo sapiens 

<400> 64 

Cys Glu Asp Leu Asp Gly Val Val 
1 5 

Cys Gly Arg Cys Val Cys Glu Arg 
20 


Cys Gly Gly His Gly Thr Cys Ser 
10 15 

Gly Trp Phe Gly Lys Leu Cys 
25 30 


<210> 65 

<211> 32 

<212> PRT 

<213> Homo sapiens 


WO 00/29435 

<400> 65 

Ser Ala Asp Gly lie Leu Cys Ser Gly 
1 5 

Lys Cys lie Cys Ser Ala Glu Glu Trp 
20 25 


PCT/US99/25031 

59 

Lys Gly Ser Cys His Cys Gly 
10 15 

Tyr lie Ser Gly Glu Phe Cys 
30 


<210> 66 

<211> 33 

<212> PRT 

<213> Homo sapiens 

<400> 66 

Cys Asp Lys His Asp Gly Leu lie 
1 5 

Cys Gly Asn Cys Glu Cys Trp Asp 
20 

lie 


Cys Thr Gly Asn Gly lie Cys Ser 
10 15 

Gly Trp Asn Gly Asn Ala Cys Glu 
25 30 


<210> 67 

<211> 769 

<212> PRT 

<213> Homo sapiens 

<400> 67 

Met Cys Gly Ser Ala Leu Ala Phe Phe Thr Ala Ala Phe Val Cys Leu 
1 5 10 15 

Gin Asn Asp Arg Arg Gly Pro Ala Ser Phe Leu Trp Ala Ala Trp Val 
20 25 30 

Phe Ser Leu Val Leu Gly Leu Gly Gin Gly Glu Asp Asn Arg Cys Ala 
35 40 . 45 

Ser Ser Asn Ala Ala Ser Cys Ala Arg Cys Leu Ala Leu Gly Pro Glu 
50 55 60 

Cys Gly Trp Cys Val Gin Glu Asp Phe He Ser Gly Gly Ser Arg Ser 
65 70 75 80 

Glu Arg Cys Asp He Val Ser Asn Leu He Ser Lys Gly Cys Ser Val 
85 go 95 

Asp Ser He Glu Tyr Pro Ser Val His Val He He Pro Thr Glu Asn 
100 105 no 

Glu He Asn Thr Gin Val Thr Pro Gly Glu Val Ser He Gin Leu Arg 
115 120 125 

Pro Gly Ala Glu Ala Asn Phe Met Leu Lys Val His Pro Leu Lys Lys 
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!30 135 140 

Tyr Pro Val Asp Leu Tyr Tyr Leu Val Asp Val Ser Ala Ser Met His 
145 150 155 160 

Asn Asn lie Glu Lys Leu Asn Ser Val Gly Asn Asp Leu Ser Arg Lys 
165 170 175 

Met Ala Phe Phe Ser Arg Asp Phe Arg Leu Gly Phe Gly Ser Tyr Val 
180 185 igo 

Asp Lys Thr Val Ser Pro Tyr He Ser He His Pro Glu Arg He His 
195 200 205 

Asn Gin Cys Ser Asp Tyr Asn Leu Asp Cys Met Pro Pro His Gly Tyr 
210 215 220 

He His Val Leu Ser Leu Thr Glu Asn He Thr Glu Phe Glu Lys Ala 
225 230 235 240 

Val His Arg Gin Lys He Ser Gly Asn He Asp Thr Pro Glu Gly Gly 
245 250 25.5 

Phe Asp Ala Met Leu Gin Ala Ala Val Cys Glu Ser His He Gly Trp 
260 265 270 

Arg Lys Glu Ala Lys Arg Leu Leu Leu Val Met Thr Asp Gin Thr Ser 
275 280 285 

His Leu Ala Leu Asp Ser Lys Leu Ala Gly He Val Val Pro Asn Asp 
290 295 300 

Gly Asn Cys His Leu Lys Asn Asn Val Tyr Val Lys Ser Thr Thr Met 
305 310 315 320 

Glu His Pro Ser Leu Gly Gin Leu Ser Glu Lys Leu He Asp Asn Asn 
325 330 335 

He Asn Val He Phe Ala Val Gin Gly Lys Gin Phe His Trp Tyr Lys 
340 345 350 

Asp Leu Leu Pro Leu Leu Pro Gly Thr He Ala Gly Glu He Glu Ser 
355 360 365 

Lys Ala Ala Asn Leu Asn Asn Leu Val Val Glu Ala Tyr Gin Lys Leu 
370 375 380 

He Ser Glu Val Lys Val Gin Val Glu Asn Gin Val Gin Gly He Tyr 
385 390 395 400 

Phe Asn He Thr Ala He Cys Pro Asp Gly Ser Arg Lys Pro Gly Met 
405 410 415 

Glu Gly Cys Arg Asn Val Thr Ser Asn Asp Glu Val Leu Phe Asn Val 
420 425 430 

Thr Val Thr Met Lys Lys Cys Asp Val Thr Gly Gly Lys Asn Tyr Ala 
435 440 445 


WO 00/29435 


PCTYUS99/25031 


He He Lys Pro He Gly Phe Asn Glu Thr Ala Lys He His He His 
450 455 460 

Arg Asn Cys Ser Cys Gin Cys Glu Asp Asn Arg Gly Pro Lys Gly Lys 
465 470 475 480 

Cys Val Asp Glu Thr Phe Leu Asp Ser Lys Cys Phe Gin Cys Asp Glu 
485 490 495 

Asn Lys Cys His Phe Asp Glu Asp Gin Phe Ser Ser Glu Ser Cys Lys 
500 505 510 

Ser His Lys Asp Gin Pro Val Cys Ser Gly Arg Gly Val Cys Val Cys 
515 520 525 

Gly Lys Cys Ser Cys His Lys lie Lys Leu Gly Lys Val Tyr Gly Lys 
530 535 540 

Tyr Cys Glu Lys Asp Asp Phe Ser Cys Pro Tyr His His Gly Asn Leu 
545 550 555 560 

Cys Ala Gly His Gly Glu Cys Glu Ala Gly Arg Cys Gin Cys Phe Ser 
565 570 575 

Gly Trp Glu Gly Asp Arg Cys Gin Cys Pro Ser Ala Ala Ala Gin His 
580 585 590 

Cys Val Asn Ser Lys Gly Gin Val Cys Ser Gly Arg Gly Thr Cys Val 
595 600 605 

Cys Gly Arg Cys Glu Cys Thr Asp Pro Arg Ser He Gly Arg Phe Cys 
61 0 615 620 

Glu His Cys Pro Thr Cys Tyr Thr Ala Cys Lys Glu Asn Trp Asn Cys 
625 630 635 640 

Met Gin Cys Leu His Pro His Asn Leu Ser Gin Ala He Leu Asp Gin 
645 650 655 

Cys Lys Thr Ser Cys Ala Leu Met Glu Gin Gin His Tyr Val Asp Gin 
660 665 670 

Thr Ser Glu Cys Phe Ser Ser Pro Ser Tyr Leu Arg He Phe Phe He 
675 680 685 

He Phe He Val Thr Phe Leu He Gly Leu Leu Lys Val Leu He He 
690 695 700 

Arg Gin Val He Leu Gin Trp Asn Ser Asn Lys He Lys Ser Ser Ser 
705 710 715 720 

Asp Tyr Arg Val Ser Ala Ser Lys Lys Asp Lys Leu He Leu Gin Ser 
725 730 735 

Val Cys Thr Arg Ala Val Thr Tyr Arg Arg Glu Lys Pro Glu Glu He 
740 745 750 


WO 00/29435 


PCI7US99/25031 


62 

Lys Met Asp lie Ser Lys Leu Asn Ala His Glu Thr Phe Arg Cys Asn 

755 760 765 

Phe 


<210> 68 

<211> 550 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> SITE 
<222> (29) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (445) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (460) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (462) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (477) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (492) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (501) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (513) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (523) 

<22 3> n equals a,t,g, or c 


<220> 


WO 00/29435 PCT/US99/2503I 
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<221> SITE 
<222> (534) 

<223> n equals a,t,g, or c 
<400> 68 


GGCACGAGTG 

GAAACTGCTA 

-L w^j-irivrvxW _L 

tjljl llataCATG 

GAGATAAATG 

TGAATTCCAG 

60 

TGCGATATCA 

CCCCCTGGGA 


AG ATG C AC GT 

CTCCAGATGG 

CAAAATCTGC 

120 

AATAACAGAG 

GGACTTGTGT 

ATGTGGTGAA 

TGTACCTGTC 

ACGATGTTGA 

TCCGAC TGGG 

180 

GACTGGGGAG 

ATATTCATGG 

GGACACCTGT 

GAATGTGATG 

AGAGGGACTG 

TAGAGCTGTC 

240 

TATGACCGAT 

ATTCTGATGA 

CTTCTGTTCA 

GGTCATGGAC 

AGTGTAATTG 

CGGAAGATGT 

300 

GACTGCAAAG 

CAGGCTGGTT 

ATGGGAAGAA 

GTGTGAGCAC 

CCACAGTCCT 

GCACGCTGTC 

360 

AGCTGAGGAG 

AGCATCAGGA 

AGTGCCAAGG 

AAGCTCGGAT 

CTGCCTTGCT 

CTGGGAAGGG 

420 

TAAATGTGAA 

TGTGGCAAAT 

GCACNTGGTA 

TCCTCCAGGN 

GNTCCGCCGG 

GTGTATNGGC 

480 

AAACTTGTTG 

ANTTGTGATG 

NTCGCCGCTG 

TGNAAGACCC 

CGNGGTGTGG 

TCTNGGAAGC 

540 

C AC GGC AC AT 






550 


<210> 69 

<211> 495 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> SITE 
<222> (10) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> {56] 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (69) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (91) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (258) 

<223> n equals a,t,g, or c 


<220> 


WO 00/29435 
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<221> SITE 
<222> (302) 

<223> n equals a,t,g f or c 
<220> 

<221> SITE 
<222> (321) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (338) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (349) 

<223> n equals a,t,g, ox c 
<220> 

<221> SITE 
<222> (350} 

<22 3> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (355) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (359) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (362) 

<223> n equals a,t,g f or c 


<220> 

<221> SITE 
<222> (363) 
<223> n equals 


a,t,g, or c 


<220> 

<221> SITE 
<222> (369) 

<223> n equals a,t,g ( or c 
<220> 

<221> SITE 
<222> (378) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (386) 

<22 3> n equals a,t,g, or c 


WO 00/29435 
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65 

<220> 

<221> SITE 
<222> (387) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (388) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (395) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (401) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (423) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (448) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (468) 

<22 3> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (494) 

<223> n equals a,t,g, or c 
<400> 69 


AATTCGGCAN 

AGGATTTTTT 

CTGGGCCATT 

AGAACATATA 

AATGCGGAGG 

AAACCNTGTA 

60 

TATTCACCNC 

TAGGACAGGT 

TAAAAAGACC 

NTTGTATGTT 

TTTCTATTTC 

TGAATTACGA 

120 

ATGAAATCCG 

AGTACCTATT 

AGAAATGAAG 

TTATGCAAAT 

TTAGATGCAA 

ATAACATTAG 

180 

AAAAAAAAGA 

TTCTTCCATA 

ATTAACATAA 

GTGGTTCCTA 

ACGAGAGCAA 

TTTTTCCACC 

240 

C AAAAG T CAT 

TTGGCAANAT 

CTACAGACCA 

TTTTTGATTG 

TCACACTGGG 

GTCGGGTAGG 

300 

ANGTATGCTG 

CCAGACATTT 

NGGTGGGGTA 

GAGGGCCNGG 

GATGCTGCNN 

GGGCNTCCNC 

360 

CNNTTGTTNC 

AGGCCGGNCC 

CCCCANNNAA 

GGGANTTTTT 

NCCCGGCCCC 

AAATGGCCCA 

420 

TTNGGGGTTC 

AAACTTGGGG 

AACCCTTNGG 

GTTTTTTTGG 

GCTTTTTNGG 

AATTCCCCAT 

480 
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TTTTTTCCCC AGGNA A , 


<210> 70 

<211> 495 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> SITE 
<222> (10) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (126) 

<22 3> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (182) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (192) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (248) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (350) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (370) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (377) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (380) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (389) 

<223> n equals a,t,g, or c 


WO 00/29435 
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67 

<220> 

<221> SITE 
<222> (399) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (406) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (408) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (412) 

<22 3> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (414) 

<22 3> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (424) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (441) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (444) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (445) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (447) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (459) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 


WO 00/29435 
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68 

<222> (464) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (466) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (467) 

<223> n equals a,t,g, or- c 
<220> 

<221> SITE 
<222> (468) 

<223> n equals a,t,g f or c 
<220> 

<221> SITE 
<222> (473) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (480) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (483) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (484) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (485) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (488) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (492) 

<223> n equals a,t,g, or c 
<400> 70 

AATTCGGCAN AGCCAACATC CGCGGAAGTG TAACATGACG GAAGAACAAA GCAAGAATCT 6 0 


GTGTGAATCA GCAGATGGCA TATTGTGCTC GGGGAAGGGT TCTTGTCATT GTGGGAAGTG 


120 


WO 00/29435 PCT/US99/25031 
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CATTTNTTCT 

GCTGAGGAGT 

GGTATATTTC 

TGGGGAGTTC 

TGTGAACTGT 

GAATGACAGA 

180 

GNACTGCGAC 

ANACATGGAT 

GGTCTCATTT 

GTACAGGGAA 

TGGAATATGT 

AGCTGTGGGA 

240 

AAC TGTGNAA 

TGCTGGGGAT 

GGATGGGAAT 

GGGAAATGCA 

TGTGAAAATC 

TGGGCTTGGG 

300 

CTCCAGAATA 

TCCCTTTAAC 

CATTTACATG 

AGGAGAGGGT 

CTTGGATTCN 

TAATTTTTTC 

360 

CTGGGGGCCN 

TTAGGGNCCN 

TTTAAATGNC 

GGGGGGAANC 

CTGTTNTNTT 

TNCNCCCTGG 

420 

GGGNCGGTTT 

AAAAAGCCCC 

NTGNNTNTTT 

TTCCCTTTNC 

GGANTNNNGG 

GTNAAAACCN 

480 

GGNMNC CNTT 

TNGAA 





495 


<210> 71 

<211> 495 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> SITE 
<222> (36) 

<22 3> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (38) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (53) 

<22 3> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (68} 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (69) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (81) 

<223> n equals a # t,g, or c 
<220> 

<221> SITE 
<222> (82) 

<223> n equals a,t,g, or c 


<220> 
<221> 


SITE 


WO 00/29435 


PCT/US99/25031 


70 

<222> (103) 

<22 3> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (105) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (109) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (113) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (123) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (131) 

<22 3> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (134) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (158) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (224) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (238) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (243) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (258) 

<223> n equals a,t,g, or c 


WO 00/29435 
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<220> 

<221> SITE 
<222> (268) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (313) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (319) 

<223> n equals a,t r g, or c 


<220> 

<221> SITE 
<222> (327) 
<223> n equals 


a, t , g, or c 


<220> 

<221> SITE 
<222> {338) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (339) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (358) 

<2 23> n equals a,t;g, or c 
<220> 

<221> SITE 
<222> (365) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (370) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (371) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (374) 

<223> n equals a,t,g, or c 


<220> 

<221> SITE 
<222> (377) 


WO 00/29435 
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72 

<2 23> n equals ajt,g, or c 
<220> 

<221> SITE 
<222> (381) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (386} 

<223> n equals a ( t ( g, or c 
<220> 

<221> SITE 
<222> (391) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (394) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (400) 

<223> n equals a,t,g 7 or c 
<220> 

<221> SITE 
<222> (408) 

<223> n equals a, t, g, or c 
<220> 

<221> SITE 
<222> (423) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (428) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (446) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (456) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (460) 

<223> n equals a,t,g, or c 
<220> 


WO 00/29435 
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73 

<221> SITE 
<222> (463) 

<223> n equals a,t,g, or c 


<220> 

<221> SITE 
<222> (472) 

<223> n equals a ; t,g, or c 
<220> 

<221> SITE 
<222> (487) 

<223> n equals a,t,g, or c 


<400> 71 
AATTCGGCAC 

AGGATTTTTT 

CTGGGCCATT 

AGAACNTNTA 

AATGCGAGGA 

AANCTTGTAT 

60 

ATTCACCNNT 

TGGACAGGTT 

NNAAAG A C C G 

TTGTATGTTT 

TTNTNTTTNT 

GANTTACGGA 

120 

TGNAAATCCG 

NGTNACCTAT 

TAGAAATGGG 

TTATGCANAT 

TTAGATGCAA 

ATAACATTAG 

180 

AAAAAAAAGA 

TTCTTCCATA 

ATTAACATAA 

GTGGTTCCTA 

ACGNGAGCAA 

TTTTTCCNCC 

240 

CTKAAGTCAT 

TTGGCAANTC 

TACAGACNAT 

TTTGGTTGTC 

ACACTGGGTC 

GGGTAGGAAG 

300 

GTATGCTGGC 

AGNCATTTNG 

TGGGTANAGG 

GCCCTGGNNT 

GCTGTTGAAG 

CATCCCCNAG 

360 

TGTANCAGGN 

NCGNGCNCCA 

MACCANGGGG 

NTTNATCCCN 

GCCCCAANTG 

CCCATGGGGG 

420 

TTNAAACNTG 

GGGAAACGTT 

GGGGTNTTTT 

GGGCTNTTTN 

GGNAATTCCC 

CNTTTCTTTC 

480 

ACCAGGNAAG 

GCCCC 





495 


<210> 72 

<211> 317 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> SITE 
<222> (120) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (121) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (185) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (267) 
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<22 3> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (288) 

<22 3> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (295) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (307) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (308) 

<223> n equals a,t,g, or c 
<400> 72 

GGCACAGTCA AATTTTATCG TATTTATTTA GGAATTTGTC TTCTCATTAC TCAGAAGAAC 6 0 

ATTTATATAT TCTTGATAGG AATCCTTTGG CAGGCCATGG TAAGTGTGAC TGTGGCAAGN 12 0 

NGCAAGTGTG AC C AGGGATG GTATGGGGAT GCTTGCCAGT ACCCAACTAA CTGTGACTTG 180 
ACAANGAAGA AAAGTAAC C A AATGTGGCAA GAATTCACAA GACATCATCT GCTCTAATTC 240 
AGGTACATGT CACTGTGGCA GTGTAANTGT GATAGATTCA GATGGAANTG GGACNTGGTG 3 00 

TATGGGNNAA TTTGTGG 317 


<210> 73 

<211> 36 

<212> PRT 

<213> Homo sapiens 

<400> 73 

lie Cys Gly Gly His Gly Lys Cys 
1 5 

Ala Gly Trp His Gly Asp Lys Cys 
20 

Trp Glu Ser Lys 
35 


Tyr Cys Gly Asn Cys Tyr Cys Lys 

10 15 

Glu Phe Gin Cys Asp lie Thr Pro 
25 30 


<210> 74 

<211> 35 

<212> PRT 

<213> Homo sapiens 


WO 00/29435 

<400> 74 

Gly Gin Pro Pro Gly Ala Ala Leu Cys 

1 5 

Cys Gly Val Cys He Cys His Val Thr 
20 25 

Pro Leu Cys 
35 


PCT/US99/25031 

75 

His Gly Arg Gly Arg Cys Asp 
10 15 

Glu Pro Gly Met Phe Phe Gly 
30 


<210> 75 
<211> 148 
<212> PRT 

<213> Homo sapiens 
<400> 75 

Met Ala Pro Gly Leu Arg Gly Leu Pro Arg Cys Gly Leu Trp Leu Leu 
1 5 10 15 

Leu Ala His His Leu Phe Met Val Thr Ala Cys Arg Asp Pro Asp Tyr 
20 25 30 

Gly Thr Leu He Gin Glu Leu Cys Leu Ser Arg Phe Lys Glu Asn Met 
35 40 45 

Glu Thr lie Gly Lys Thr Leu Trp Cys Asp Trp Gly Lys Thr lie Gin 
50 55 60 

Ser Tyr Gly Glu Leu Thr Tyr Cys Thr Lys His Val Ala His Thr He 
65 70 75 80 

Gly Cys Phe Trp Pro Asn Pro Glu Val Asp Arg Phe Phe He Ala Val 
85 90 95 

His His Arg Tyr Phe Ser Lys Cys Pro He Ser Gly Arg Ala Leu Arg 
100 105 HO 

Asp Pro Pro Asn Ser lie Leu Cys Pro Phe He Ala Leu Pro He Thr 
115 120 125 

Val Thr Leu Leu Met Thr Ala Leu Val Val Trp Arg Ser Lys Arg Thr 
130 135 140 

Glu Gly He Val 
145 


<210> 76 

<211> 502 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> SITE 
<222> (10) 

<22 3> n equals a, t,g, or c 


WO 00/29435 


PCT/US99/25031 


76 

<220> 

<221> SITE 
<222> (170) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> {209) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (284) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> {304} 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (338) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (347) 

<223> n equals a,t,g ( or c 
<220> 

<221> SITE 
<222> (356) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (367) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (379) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (405) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (415) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (420) 


WO 00/29435 


PCT/US99/25031 


77 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (424) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (448) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (454) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (459) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (465) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (472) 

<223> n equals a r t,g, or c 
<220> 

<221> SITE 
<222> (485) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (487) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (498) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (501) 

<22 3> n equals a,t,g, or c 
<400> 76 

AATTCGGCAN AGCTGGTCAT GGGCAGACCC CTCCCTTCCT GGGCTGACCT GCTCCCTCGA 6 0 

GGCCAGCCTG CTCCCTGGCT GAGGCTCAGG CTATCCGCCC AAGCTCTTTG CTCATTCTAG 12 0 


GGCCAGTGGA GGAAAATGTG ATAAGGCCAG AGCTTGTGTG CTGGGCAAGN AAATCACCTG 


180 


WO 00/29435 PCT/US99/2S031 

78 


CTGCATCCTG 

TGCTCCGCAG 

CTGGGCCGNA 

GCCTCTGCCC 

GCAGTTTCTA 

TGCTGTTTCT 

240 

TAGCACAGAA 

TCCAGCCTAG 

CCTTAGCCGC 

AGTCTAGGCC 

CTGNTTGGGA 

CTAGGACTCC 

300 

TTGNTTGACC 

CCATCTTTGG 

TTCCTGCCTG 

GTTCCTGNAA 

CAGCCCNAGT 

TCTGGNTAAA 

360 

TCCAGGNAGA 

AAGTTAGGNA 

AGGGTTTTTG 

GAAGAAGTTC 

CGTGNTTTGA 

ACTTNGGAGN 

420 

CTTNTTGGTG 

GGAAGAAAAT 

TGGAAAGNTT 

GGGNAGGANA 

AAGGNGGGTT 

TNGGGGTTTC 

480 

AAGGNGNATG 

GGGGGTTNAA 

NA 




502 


<210> 77 

<211> 505 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> SITE 
<222> (11) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (14) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (67) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (208) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (247) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (250) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (267) 

<223> n equals a,t,g, or c 


<220> 

<221> SITE 
<222> (299) 


WO 00/29435 


PCT/US99/25031 


79 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (320) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (321) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (339) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (349) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (364) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (384) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (386) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (406) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (417) 

<223> n equals a,t ; g, or c 
<220> 

<221> SITE 
<222> (424) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (440) 

<223> n equals a,t,g, or c 


<220> 


WO 00/29435 


PCT/US99/25031 


80 

<221> SITE 
<222> (446) 

<223> n equals a,t,g, or e 
<220> 

<221> SITE 
<222> (448) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (467) 

<2 23> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (474) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (484) 

<223> n equals a r t f g f or c 
<220> 

<221> SITE 
<222> (488) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (490) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (497) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (504) 

<223> n equals a,t,g, or c 
<400> 77 


GAATTCGGCA 

NAGNCTGCGG 

CCGGCCAGCC 

ATGGAGACTG 

GAGCGCTGCG 

ACGCCGCAAC 

60 

TTCTCCNGTT 

GCTGCTGCTG 

CTCTGCGGTG 

GGTGTCCCAG 

AGCAGGCGGC 

TGCAACGAGA 

120 

CAGGCATGTT 

GGAGAGGCTG 

CCCCTGTGTG 

GGAAAGGCTT 

TCGCAGACAT 

GATGGGCAAG 

180 

GTGGACGTCT 

GGGAATGGTG 

CAACTGTNCC 

GAGTTCATCG 

TGTACTATGA 

GAAGTTTCAC 

240 

CAACTGNCAN 

CGAGATGGAG 

GCCAATNTCG 

TGGGCTGCTT 

ACTGGCCCAA 

CCCCCTGGNC 

300 

CCAGGGCTTT 

CATCACCGGN 

NTCCACAGGC 

AGTTTTTTNT 

TCCAACTGNA 

ACGTGGACAA 

360 

GGTNCATTTG 

GAGGGACCCC 

CCANANGAGG 

TTTTAATCCG 

TTGATNGTTA 

TTAC CGNGGT 

420 


WO 00/29435 PCT/US99/25031 

81 

TTTNATTTGG GCATGGTTGN CTGGTNGNTT TGGGGAAAAA GGAACGNAAA GTTNTTTTAG 480 
GGTNCCGNTN AATTGGNTTG GGTNA 5O5 


<210> 78 

<211> 101 

<212> DNA 

<213> Homo sapiens 


<220> 

<221> SITE 
<222> (101) 

<223> n equals a,t,g, or c 


<400> 73 

AATTCGGCAC GAGGTGAGGG GCCCTCTGGA ATGGCATCCC ATGAGCTTGT GGCCTCTATC 
TGCTACCATC TGTGTTTTAT CTGAGTAAAG TTACCTTACT N 


<210> 79 

<211> 322 

<212> PRT 

<213> Homo sapiens 


<400> 79 
Met Gly Met Leu 
1 

Val Leu Ala Ala 
20 

Tyr Asp Ala lie 
35 

lie Pro Asp Asn 
50 

Gly Asn Arg Glu 
65 

Asp Glu Asp Lys 


Ser Pro Gin Glu 
100 

Glu Asp Phe Pro 
115 

Tyr Cys Asp Asp 
130 


Ala Arg Val Ala 
5 

Pro Thr Thr Glu 


Leu Glu Asp Thr 
40 

His Val Glu Asn 
55 

Leu Leu Thr Pro 
70 

Asp Glu Glu Ser 
85 

Pro Glu Phe Pro 


Thr Cys Leu Leu 
120 

His Glu Leu Asp 
135 


Leu Gly Leu lie 
10 

Leu Phe Asn Tyr 

25 

Gly Thr Phe Tyr 


Glu Lys Val Ser 
60 

Gly Pro Gin Leu 
75 

Thr Pro Arg Leu 
90 

Gly Leu Leu Gly 
105 

Cys Thr Cys lie 


Ala lie Pro Pro 
140 


lie lie Asp Ala 
15 

Asp Ser Glu Val 
30 

Asn Tyr Glu His 
45 

Glu Arg Leu Ser 


Gly Asp Asn Gin 
80 

He Asp Gly Ser 
95 

Pro His Thr Asn 

110 

Ser Thr Thr Val 
125 

Leu Pro Lys Lys 


Thr Thr Tyr Phe Tyr Ser Arg Phe Asn Arg He Lys Lys lie Asn Lys 


WO 00/29435 


PCT/US99/2503I 


145 150 155 160 

Asn Asp Phe Ala Ser Leu Asn Asp Leu Lys Arg lie Asp Leu Thr Ser 
165 170 175 

Asn Leu lie Ser Glu Tie Asp Glu Asp Ala Phe Arg Lys Leu Pro His 

180 185 190 

Leu Gin Glu Leu Val Leu Arg Asp Asn Lys He Lys Gin Leu Pro Glu 
195 200 205 

Leu Pro Asn Thr Leu Thr Phe He Asp He Ser Asn Asn Arg Leu Gly 
210 215 220 

Arg Lys Gly lie Lys Gin Glu Ala Phe Lys Asp Met Tyr Asp Leu His 
225 230 235 240 

His Leu Tyr He Thr Asp Asn Ser Leu Asp His He Pro Leu Pro Leu 
245 250 255 

Pro Glu Ser Leu Arg Ala Leu His Leu Gin Asn Asn Asp He Leu Glu 
260 265 270 

Met His Glu Asp Thr Phe Cys Asn Val Lys Asn Leu Thr Tyr Val Arg 
275 280 285 

Lys Ala Leu Glu Asp He Arg Leu Asp Gly Asn Pro He Asn Leu Ser 
290 295 300 

Arg Thr Pro Gin Ala Tyr Met Cys Leu Pro Arg Leu Pro He Gly Ser 
305 310 315 320 


<210> 80 

<211> 435 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> SITE 
<222> (85) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (248) 

<22 3> n equals a,t,g, or c 


<400> 80 

GGCAGAGCCA CCTTTCTGGA AGCTGCAGGG CTCTCCATCC AGGATCCAGA AGCATTGAAG 60 
GGGACCAGCC GCTGAAGGGA TTCTNAGTCC CATCTGACTC CCCATGAGGC TCCTGGCTTT 12 0 


CCTGAGTCTG CTGGCCTTGG TGCTGCAGGA GACAGGGACA GCTTCTCTCC CAAGGAAGGA 


180 


WO 00/29435 PCT/US99/25031 


83 


GAGGAAGAGG 

AGAGAGGAGC 

AGATGCCCAG 

GGAAGGCGAT 

TCCTTTGAAG 

TTCTGCCTCT 

240 

GCGGAATNAT 

GTCCTGAAGC 

CAGACAACTA 

TGGTGAAGTC 

ATTGACCTGA 

GCAACTATGA 

300 

GGAGCTCACA 

GATTATGGGG 

ACCAACTCCC 

CGAGGTTAAG 

GTGACTAGCC 

TCGCTCCTGC 

360 

AACCAGCATC 

AGTCCCGCCA 

AGAGCACTAC 

GGCTCCAGGG 

ACAACCTCGT 

CAAACCCCAC 

420 

GGATGACCCA 

GACCT 





435 


<210> SI 

<211> 449 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> SITE 

<222> (57) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 

<222> (98) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 

<222> (391) 

<223> n equals a,t ; g, or c 
<220> 

<221> SITE 

<222> (433) 

<223> n equals a,t,g, or c 


<400> 81 

GACAGCCTCC ACCAGAGTCC CCACCTTTTT GGAAGCTGCA GGGCTCTCCA TCCAGGNTCC 6 0 

AGAAGCATTG AAGGGGACCA GCCGCTGAAG GGATTCTNAG TCCCATCTGA CTCCCCATGA 120 

GGCTCCTGGC TTTCCTGAGT CTGCTGGCCT TGGTGCTGCA GGAGACAGGG ACAGCTTCTT 180 

TCCCAAGGAA GGAGAGGAAG AGGAGAGAGG AGCAGATGCC CAGGGAAGGC GATTCCTTTG 240 

AAGTTCTGCC TCTGCGGAAT GATGTCCTGA ACCCAGACAA CTATGGTGAA GTCATTGACC 30 0 

TGAGCAACTA TGAGGAGCTC ACAGATTATG GGGACCAACT CCCCGAGGTT AAGGTGACTA 3 60 

GCCTCGCTCC TGCAACCAGC ATCAGTCCCG NCAAGAGCAC TACGGGCTTC AGGGGACAAC 42 0 

CTCGTCAAAC CCNACGGTGA CCAGACCTA 449 


<210> 82 
<211> 331 


WO 00/29435 


PCT/US99/25031 


<212> DNA 

<213> Homo sapiens 


<220> 

<221> SITE 
<222> (29) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (70) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (152} 

<223> n equals a,t,g, or c 


<400> 82 
CTGGAAGCTG 

CAGGGCTCTC 

CATCCAGGNT 

CCAGAAGCAT 

TGAAGGGGAC 

CAGCCGCTGA 

60 

AGGGATTCTN 

AGTCCCATCT 

GACTCCCCAT 

GAGGCTCCTG 

GCTTTCCTGA 

GTCTGCTGGC 

120 

CTTGGTGCTG 

CAGGAGACAG 

GGACAGCTTC 

TNTCCCAAGG 

AAGGAGAGGA 

AGAGGAGAGA 

180 

GGAGCAGATG 

CCCAGGGAAG 

GCGATTCCTT 

TGAAGTTCTG 

CCTCTGCGGA 

ATGATGTCCT 

240 

GAACCCAGAC 

AACTATGGTG 

AAGTCATTGA 

CCTGAGCAAC 

TATGAGGAGC 

TCACAGATTA 

300 

TGGGGACCAA 

CTCCCCGAGG 

TTAAGGTGAC 

T 



331 


<210> 83 

<211> 280 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> SITE 
<222> (19) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (58) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (115) 

<223> n equals a,t,g, or c 
<400> 83 

CTGGCTTTCC TGAGTCTGNT GGC CTTGGTG CTGCAGGAGA CAGGGACAGC TTCTTTTNCA 


AGGAAGGAGA GGAAGAGGAG AGAGGAGCAG ATGCCCAGGG AAGGCGATTC CTTTNAAGTT 


120 


WO 00/29435 PCT/US99/25031 

85 

CTGCCTCTGC GGAATGATGT CCTGAACCCA GACAACTATG GTGAAGTCAT TGACCTGAGC 1! 

AACTATGAGG AGCTCACAGA TTATGGGGAC CAACTCCCCG AGGTTAAGGT GACTAGCCTC 2' 
GCTCCTGCAA CCAGCATCAG TCCCGCCAAG AGCACTACGG 2 J 


<210> 84 

<211> 211 

<212> DNA 

<213> Homo sapiens 


<220> 

<221> SITE 
<222> (93) 

<223> n equals a,t,g, or c 


<400> 84 
CTTGGACCAG 

CGGGCATCAC 

ATTCTCCAGC 

AGCCGCCATC 

TCACACGCCT 

CCCTCCTGTG 

60 

GCCGCCGGCA 

GCATGGACAA 

AGGTCTCCAT 

GCNGGGGGAG 

GAGGCCTGCT 

TCTTTGCCCA 

120 

CAGCTCTCAC 

GTCTCCCTTC 

TCCCTGCGGG 

TGACAAAGAA 

GCCCAAGGAC 

CACCTCCTTC 

180 

CTGCCTCATT 

GTAATAAAAT 

TCCCCACACT 

G 



211 


<210> 85 

<211> 208 

<212> DNA 

<213> Homo sapiens 


<220> 

<221> SITE 
<222> (56) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> {117) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> {140) 

<223> n equals a,t,g, or c 


<400> 85 

CTGGATGGCA ACCCCATCAA CCTCAGCCTC TTCCCCAGCG CCTACTTCTG CCTGCNTCGG 60 

CTCCCCATCG GCCGCTTCAC GTAGCTCGGA GCCCTTCCAC TCCTCCCAGG TCATCTNTTG 120 

GACCAGCGGG CATCACATTN TCCAGCAGCC GCCATCTCAC ACGCCTCCCT CCTGTGGCCG 180 


CCGGCAGCAT GGACAAAGGT CTCCATGC 


208 


WO 00/29435 


86 


PCT/US99/25031 


<210> 86 

<211> 73 

<212> DNA 

<213> Homo sapiens 


<400> 86 

CGACCCACGC GTCCGCCGCC TTCGGCTTCC CCTTCTGCCA AGAGCCCTGA GCCACTCACA 
GCACGACCAG AGA 


<210> 87 

<211> 305 

<212> DNA 

<213> Homo sapiens 

<400> 87 

GTATGGAATG GGGTGGGAAC CCCTGCCTCT C AC AC TGGGG AGGGACCCTG GGGACAGCCT 
ATGGGC TGAG CAGAGAGGGC TCTCAGGGAC CCCTGCAGCA CAAGAATCTC CCACCACGGT 
CTCTGTCCCA GCCCTGACTC AGAAGCCTGA TGTCTACATC CCCGAGACCC TGGAGCCCGG 
GCAGCCGGTG ACGGTCATCT GTGTGTTTAA CTGGGCCTTT GAGGAATGTC CACCCCCTTC 
TTTCTCCTGG ACGGGGGCTG CCCTCTCCTC CCAAGGAACC AAACCAACGA CCTCCCACTT 


<210> 88 

<211> 175 

<212> DNA 

<213> Homo sapiens 

<400> 88 

ATCCTCCAGA GAACCTGAGA GTGATGGTTT CCCAAGCAAA CAGGACAGGT AGGAAAGGGG 6 0 

AC AG AGGAG C CAAGGCCTCT CAGTGCCGAA TTGGGGGCCC AGGAGTCTGG AGGGTCCCCA 12 0 

CGCAGGAGGG TCCCTGAGCC CTGAGCTGCT CATCGATTCT GCCTCTTCCT TCCCT 175 


<210> 89 

<211> 72 

<212> DNA 

<213> Homo sapiens 


<400> 89 

GTGAGTGGGG GAAAGGGGAC AC CTGGGTCC CAGGAAGGGG ACCCTGCTGA GTCCTGTCCT 
CCCTCCCCTC AG 


WO 00/29435 


87 


PCT/US99/25031 


<210> 90 

<211> 421 

<212> DNA 

<213> Homo sapiens 

<400> 90 

CTGGCCCCCT GGCTCAGAAG CGGAATCAGA AAGCCACACC AAACAGTCCT CGGACCCCTC 6 0 

TTCCACCAGG TGCTCCCTCC CCAGAATCAA AG AAGAAC C A GAAAAAGCAG TATCAGTTGC 12 0 

CCAGTTTCCC AGAACCCAAA TCATCCACTC AAGCCCCAGA ATCCCAGGAG AGCCAAGAGG 180 

AGCTCCATTA TGCCACGCTC AACTTCCCAG GCGTCAGACC CAGGCCTGAG GCCCGGATGC 2 40 

CCAAGGGCAC CCAGGCGGAT TATGCAGAAG TCAAGTTCCA ATGAGGGTCT CTTAGGCTTT 300 

AGGACTGGGA CTTCGGCTAG GGAGGAAGGT AGAGTAAGAG GTTGAAGATA ACAGAGTGCA 3 60 

AAGTTTCCTT CTCTCCCTCT CTCTCTCTCT TTCTCTCTCT CTCTCTCTTT CTCTCTCTTT 42 0 

T 421 


<210> 91 

<211> 964 

<212> DNA 

<213> Homo sapiens 

<400> 91 

AAAAAAACAT CTGGCCAGGG CACAGTGGCT CACGCCTGTA ATCCCAGCAC TTTGGGAGGT 6 0 

TGAGGTGGGC AGATCGCCTG AGGTCGGGAG TTCGAGACCA GCCTGGCCAA CTTGGTGAAA 12 0 

CCCCGTCTCT ACTAAAAATA CAAAAATTAG CTGGGCATGG TGGCAGGCGC CTGTAATCCT 180 

ACTACTTGGG AAGCTGAGGC AGGAGAATCA CTTGAACCTG GGAGACGGAG GTTGCAGTGA 240 

GCCAAGATCA CACCATTGCA CGCCAGCTTG GGCAACAAAG CGAGACTCCA TCTCAAAAAA 3 00 


AAAATCCTCC AAATGGGTTG GGTGTCTGTA ATCCCAGCAC TTTGGGAGGC TAAGGTGGGT 3 60 

GGATTGCTTG AGCCCAGGAG TTCGAGACCA GCCTGGGCAA CATGGTGAAA CCCCATCTCT 42 0 

ACAAAAAATA CAAAACATAG CTGGGCTTGG TGGTGTGTGC CTGTAGTCCC AGCTGTCAGA 48 0 

CATTTAAACC AGAGCAACTC CCATCTGGAA TGGGAGCTGA ATAAAATGAG GCTGAGACCT 540 

ACTGGGCTGC CATTCTCAGA CAGTGGAGGC CATTCTAAGT CACAGGATGA GACAGGAGGT 600 

CCGTACAAGA TACAGGTCAT AAAGACTTTG CTGATAAAAC AGATTGCAGT AAAGAAGCCA 66 0 

ACCAAATCCC ACCAAAACCA AGTTGGCCAC GAGAGTGACC TCTGGTCGTC CTCACTGCTA 720 

CACTCCTGAC AGCACCATGA CAGTTTACAA ATGCCATGGC AACATCAGGA AGTTACCCGA 7 80 


WO 00/29435 

TATGTCCCAA AAGGGGGAGG AATGAATAAT 

TAACCATAAA AGTGGGCAAC CAGCAGCTCT 

TCTTTTGTTC CTTTACTTTC TTAATAAACT 
AAAA 


PCT/US99/25031 

88 

CCACCCCTTG TTTAGCAAAT AAGCAAGAAA 84 0 

AGGCGCTGCT CTTGTCTATG GAGTAGCCAT 9 00 

TGCTTTC AC C TTAAAAAAAA AAAAAAAAAA 960 

964 


<210> 92 
<211> 364 
<212> PRT 

<213> Homo sapiens 


<400> 92 
Met Pro Leu Leu 
1 

Met Asp Pro Asn 
20 

Glu Gly Leu Cys 
35 

Tyr Tyr Asp Lys 

50 

Ala lie lie Ser 
65 

Glu Val Gin Glu 


Ser Arg Asn Asn 
100 

Asn Gly Ser Tyr 
115 

Tyr Lys Ser Pro 
130 

Pro Lys Tie Leu 
145 

Leu Thr Cys Ser 


Phe Ser Trp Leu 
180 

His Ser Ser Val 
195 

Asn Leu Thr Cys 
210 


Leu Leu Leu Pro 
5 

Phe Trp Leu Gin 


Val Leu Val Pro 
40 

Asn Ser Pro Val 
55 

Gly Asp Ser Pro 
70 

Glu Thr Gin Gly 
85 

Cys Ser Leu Ser 


Phe Phe Arg Met 
120 

Gin Leu Ser Val 
135 

He Pro Gly Thr 
150 

Val Ser Trp Ala 
165 

Ser Ala Ala Pro 


Leu lie He Thr 
200 

Gin Val Lys Phe 
215 


Leu Leu Trp Ala 
10 

Val Gin Glu Ser 
25 

Cys Thr Phe Phe 


His Gly Tyr Trp 
60 

Val Ala Thr Asn 
75 

Arg Phe Arg Leu 
90 

lie Val Asp Ala 
105 

Glu Arg Gly Ser 


His Val Thr Asp 
140 

Leu Glu Pro Gly 
155 

Cys Glu Gin Gly 
170 

Thr Ser Leu Gly 
185 

Pro Arg Pro Gin 


Ala Gly Ala Gly 
220 


Gly Ala Leu Ala 
15 

Val Thr Val Gin 
30 

His Pro lie Pro 
45 

Phe Arg Glu Gly 


Lys Leu Asp Gin 
80 

Leu Gly Asp Pro 
95 

Arg Arg Arg Asp 
110 

Thr Lys Tyr Ser 
125 

Leu Thr His Arg 


His Ser Lys Asn 
160 

Thr Pro Pro lie 
175 

Pro Arg Thr Thr 
190 

Asp His Gly Thr 

205 

Val Thr Thr Glu 


WO 00/29435 


PCT/US99/25031 


Arg Thr lie Gin Leu Asn Val Thr Tyr Val Pro Gin Asn Pro Thr Thr 
225 230 235 240 

Gly lie Phe Pro Gly Asp Gly Ser Gly Lys Gin Glu Thr Arg Ala Gly 
2^5 250 255 

Val Val His Gly Ala He Gly Gly Ala Gly Val Thr Ala Leu Leu Ala 
260 265 270 

Leu Cys Leu Cys Leu He Phe Phe He Val Lys Thr His Arg Arg Lys 
275 280 285 

Ala Ala Arg Thr Ala Val Gly Arg Asn Asp Thr His Pro Thr Thr Gly 
290 295 300 

Ser Ala Ser Pro Lys His Gin Lys Lys Ser Lys Leu His Gly Pro Thr 
305 310 315 320 

Glu Thr Ser Ser Cys Ser Gly Ala Ala Pro Thr Val Glu Met Asp Glu 
325 330 335 

Glu Leu His Tyr Ala Ser Leu Asn Phe His Gly Met Asn Pro Ser Lys 
340 345 350 

Asp Thr Ser Thr Glu Tyr Ser Glu Val Arg Thr Gin 
355 360 


<210> 93 

<211> 397 

<212> DMA 

<213> Homo sapiens 

<220> 

<221> SITE 
<222> (19) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (68) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (148) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (262) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (323) 

<223> n equals a,t,g, or c 


WO 00729435 


PCT/US99/25031 


90 

<220> 

<221> SITE 
<222> (361) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (386) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (388) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (390J 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (392) 

<223> n equals a,t,g-, or c 
<220> 

<221> SITE 
<222> (394) 

<223> n equals a,t,g, or c 


<400> 93 

GGCAGAGTGG CCCCCTGGNT CAGAAGCGGA ATCAGAAAGC CACACCAAAC AGTCCTCGGA 60 

CCCCTCTNCC ACCAGGTGCT CCCTCCCCAG AATCAAAGAA GAACCAGAAA AAGCAGTATC 120 

AGTTGCC C AG TTTCCCAGAA CCCAAATNAT CCACTCAAGC CCCAGAATCC CAGGAGAGCC 180 

AAGAGGAGCT CCATTATGCC ACGCTCAACT TCCCAGGCGT CAGACCCAGG CCTGAGGCCG 24 0 

GTATGCCCAA GGGCACCCAG GNGGATTATG CAGAAGTCAA GTTCCAATGA GGGTCTCTTA 3 00 

GGCTTTAGGA CTGGGACTTC GGNTAGGGAG G AAGG TAG AG TAAGAGGTTG AAGTTAACAG 360 

NTGCAAATTT CCTTTTTTTC CTTTTNTNTN TNTNTTT 397 


<210> 94 

<211> 276 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> SITE 
<222> (190) 

<223> n equals a ; t,g, or c 


<220> 
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<221> SITE 
<222> (241) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (262) 

<22 3> n equals a,t ( g, or c 
<220> 

<221> SITE 
<222> (264) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (265) 

<223> n equals a,t,g, or c 
<400> 94 

GCTCCTACCC CCGACAAGAC TGGACAGGGT CTACCCCAGC TTATGGCTAC TGGTTCAAAG 
CAGTGACTGA GACAACCAAG GGTGCTCCTG TGGCCACAAA CCACCAGAGT CGAGAGGTGG 
AAATGAGCAC CCGGGGCCGA TTCCAGCTCA CTGGGGATCC CGCCAAGGGG AACTGCTCCT 
TGGTGATCAN AAGACGCGCA GATGCAGGAT GAGTCACAGT ACTTCTTTCG GGTGGAGAGA 
NGAACTATGT GAGATATAAT GNCNNGAACG ATGGGT 


<210> 95 

<211> 469 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> SITE 
<222> (348) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (375) 

<2 23> n equals a,t,g, or a 
<220> 

<221> SITE 
<222> (430) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (445) 

<223> n equals a,t,g, or c 


<220> 
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<221> SITE 
<222> (457) 

<22 3> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (469) 

<223> n equals a,t,g, or c 
<400> 95 


GCAACGACCT CCCACTTCTC AGTGCTCAGC TTCACGCCCA GACCCCAGGA CCACAACACC 60 

GACCTCACCT GCCATGTGGA CTTCTCCAGA AAGGGTGTGA GCGTACAGAG GACCGTCCGA 12 0 

CTCCGTGTGG CCTATGCCCC CAGAGACCTT GTTATCAGCA TTTCACGTGA CAACACGCCA 180 

GATCCTCCAG AGAACCTGAG AGTGATGGTT TCCCAAGCAA ACAGGACAGT CCTGGAAAAC 240 

CTTGGGAACG GCACGTCTCT CCCAGTACTG GAG GGC C AAA GCCTGTGCCT GGTCTGTGTC 3 00 

ACACACAGCA GCCCCCCAGC CAGGCTGAGC TGGACCCAGA AGGGACAWGT TCTGAGCCCC 3 60 

TCCCAGCCCT CAGANCCCGG GGTCCTGGAG TGCCTTCGGG TTCAAGTGGA GCAACGAAAG 42 0 

GAGAGT T CAN CTGGCAAGGT TCGGNAACCA ATTGGGNTTC CAAGAAGGN 4 69 


<210> 96 

<211> 398 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> SITE 
<222> (160) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (189) 

<223> n equals a,t ( g, or c 
<220> 

<221> SITE 
<222> (221) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (248) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (285) 

<223> n equals a,t,g, or c 
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<220> 

<221> SITE 
<222> (326) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (339) 

<223> n equals a,t,g ; or c 
<220> 

<221> SITE 
<222> (344) 

<22 3> rt equals a,t,g, or c 
<220> 

<221> SITE 
<222> (382) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (392) 

<223> n equals a,t,g, or c 


<400> 96 

ACGGAGCGTT TCTGGGAATC GGCATCACGG CTCTTCTTTT CCTCTGCCTG GCCCTGATCA 60 

TCATGAAGAT TCTACCGAAG AGACGGACTC AGACAGAAAC CCCGTAGGCC CAGGTTCTCC 12 0 

CGGCACAGCA CGATCCTGGA TTACATCAAT GTGGTCCCGN ACGGACTGGC CCCCTGGCTC 18 0 

AGAAGCGGNA ATCAGTAAAG CCACACCAAA CAGTCCTCGG NACCCCTCTT GCCACCAGGT 2 40 

GCTCCCTNCC CCAGGAATGC AAAGAAGAAC CAGAAAAAGG CAGTNATGCA GTTGCCCAGT 3 00 

TTGCCCAGAA CCCAAATCAT TCCATNCAAG CCCCAGAANC CCANGAGAGC CAAGAGGAGT 3 60 

TCCATTAAGG CCAGGTTCAA ANTTTCCCAG GNGTTCGA 398 


<210> 97 

<211> 256 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> SITE 

<222> (18) 

<223> n equals a f t,g, or c 
<220> 

<221> SITE 

<222> (40) 

<223> n equals a,t,g, or c 


<220> 
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<221> SITE 
<222> (52) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (68) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (73) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (85) 

<223> n equals a,t,g f or c 
<220> 

<221> SITE 
<222> (91) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (126) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (127) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (162) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (164) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (184) 

<2 23> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (251) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (256) 

<2 23> n equals a f t,g, or c 
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<400> 97 

TCGACCCACG CATCCGCNGC CTTCAGCTTC CCCTTCTGCN AAGAGCCCTG ANCCACTCAC 6 0 

AGCACGANCA GANAACAGGC CTGTNTCAAG NAGGCCCTGC GCCTCCTATG CGGAGATGCT 12 0 

ACTGCNNCTG CTGCTGTCCT CGCTGCTGGG CGGGTCCCAG GNTNTGGATG GGAGATTCTG 18 0 

GATNCGAGTG CAGGAGTCAG TGATGGTGCC GGAAGGCCTG TGCATCTCTG TGCCCTGCTC 24 0 

TTTCTCCTAC NCCCGN 25 6 


<210> 98 

<211> 299 

<212> DNA 

<213> Homo sapiens 


<220> 

<221> SITE 
<222> (239) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (264) 

<223> n equals a,t,g, or c 


<400> 98 

CCAACAGCTC CCTGAGCCTC CATGGAGGGC TCAGCTCCGG CCTCAGGCTC CGCTGTAAGG 60 

CCTGGAACGT CCACGGGGCC CAGAGTGGCT CTGTCTTCCA GCTGCTACCA GGGTGAAGAT 12 0 

CTGCAGGAAG GAAGCTCGCA AGAGGGCAGC AGC TGAGCAG GACGTGCCCT CCACCCTGGG 180 

ACCCATCTCC CAGGGTCACC AGCATGAATG CTCGGCAGGC AGCTCCCAAG ACCACCCGNC 2 40 

CCCAGGTGCA GCCACCTACA CCCNGGGGAA GGGGGAAGAG CAGGAGCTCC ACTATGCCT 299 


<210> 99 

<211> 442 

<212> PRT 

<213> Homo sapiens 


<400> 99 

Met Leu Pro Leu Leu Leu Pro Leu Leu Trp Ala Gly Ala Leu Ala Gin 
15 10 15 

Glu Arg Arg Phe Gin Leu Glu Gly Pro Glu Ser Leu Thr Val Gin Glu 
20 25 30 

Gly Leu Cys Val Leu Val Pro Cys Arg Leu Pro Thr Thr Leu Pro Ala 
35 40 45 


Ser Tyr Tyr Gly Tyr Gly Tyr Trp Phe Leu Glu Gly Ala Asp Val Pro 
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50 55 60 

Val Ala Thr Asn Asp Pro Asp Glu Glu Val Gin Glu Glu Thr Arg Gly 
65 70 75 80 

Arg Phe His Leu Leu Trp Asp Pro Arg Arg Lys Asn Cys Ser Leu Ser 
85 90 95 

lie Arg Asp Ala Arg Arg Arg Asp Asn Ala Ala Tyr Phe Phe Arg Leu 
100 105 110 

Lys Ser Lys Trp Met Lys Tyr Gly Tyr Thr Ser Ser Lys Leu Ser Val 
115 120 125 

Arg Val Met Ala Leu Thr His Arg Pro Asn lie Ser lie Pro Gly Thr 
130 135 140 

Leu Glu Ser Gly His Pro Ser Asn Leu Thr Cys Ser Val Pro Trp Val 
145 150 155 160 

Cys Glu Gin Gly Thr Pro Pro lie Phe Ser Trp Met Ser Ala Ala Pro 
165 170 175 

Thr Ser Leu Gly Pro Arg Thr Thr Gin Ser Ser Val Leu Thr He Thr 
180 185 190 

Pro Arg Pro Gin Asp His Ser Thr Asn Leu Thr Cys Gin Val Thr Phe 
195 200 205 

Pro Gly Ala Gly Val Thr Met Glu Arg Thr He Gin Leu Asn Val Ser 
210 , 215 220 

Tyr Ala Pro Gin Lys Val Ala He Ser He Phe Gin Gly Asn Ser Ala 
225 230 235 240 

Ala Phe Lys He Leu Gin Asn Thr Ser Ser Leu Pro Val Leu Glu Gly 
245 250 255 

Gin Ala Leu Arg Leu Leu Cys Asp Ala Asp Gly Asn Pro Pro Ala His 
260 265 270 

Leu Ser Trp Phe Gin Gly Phe Pro Ala Leu Asn Ala Thr Pro He Ser 
275 280 285 

Asn Thr Gly Val Leu Glu Leu Pro Gin Val Gly Ser Ala Glu Glu Gly 
290 295 300 

Asp Phe Thr Cys Arg Ala Gin His Pro Leu Gly Ser Leu Gin He Ser 
305 310 315 320 

Leu Ser Leu Phe Val His Trp Lys Pro Glu Gly Arg Ala Gly Gly Val 
325 330 335 

Leu Gly Ala Val Trp Gly Ala Ser He Thr Thr Leu Val Phe Leu Cys 
340 345 350 

Val Cys Phe lie Phe Arg Val Lys Thr Arg Arg Lys Lys Ala Ala Gin 
355 360 365 
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Pxo Val Gin Asn Thr Asp Asp Val 
370 375 

Arg Gly His Gin His Gin Phe Gin 
385 390 

Ala Glu Ala Gly Pro lie Ser Glu 
405 

Val Leu His Phe His Lys Val Gin 

420 

Thr Glu Tyr Ser Glu He Lys He 
435 440 


Asn Pro Val Met Val Ser Gly Ser 

380 

Thr Gly lie Val Ser Asp His Pro 

395 400 

Asp Glu Gin Glu Leu His Tyr Ala 
410 415 

Pro Gin Glu Pro Lys Val Thr Asp 
425 430 

His Lys 


<210> 100 
<211> 258 
<212> DNA 
<213> Homo sapiens 

<220> 

<221> SITE 
<222> (111) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (193) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (196) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (230) 

<223> n equals a,t,g, or c 
<400> 100 

AAACTCGGGA CCGATTCCAC CTCCTTGGGG ACCCACAGAC CAAAAATTGC ACCCTGAGCA 6 0 

TCAGAGATCC AGAATGAGTG ATGCGGGGAG ATACTTCTTT CGTATGGAGA NAGGAAATAT 120 
AAAATGGAAT TATAAATATG ACCAGCTCTC TGTGAACGTG ACAGACCCTC CTCAGAACTT 180 
GACTGTGACT GTNTTNCAAG GAGAAGGCAC AGCATCCACA GCTCTGGGGN ACAGCTCATC 240 
TCTTTCAGTC CTAGAGGG 2 c fi 


<210> 101 
<211> 139 
<212> DNA 
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<2 13 > Homo sapiens 
<220> 

<221> SITE 
<222> (21) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (26) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (34) 

<223> r± equals a,t,g, or c 
<220> 

<221> SITE 
<222> (37) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (92) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (110) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (123) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (131) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (133) 

<223> n equals a,t,g, or c 
<400> 101 

CTCGCTGACG ATGCAGAGTT NCGTGNCCGT GCANGANGGC ATGTGTGTCC ATGTGCGCTG 
CTCCTTCTCC TACCCAGTGG ACGCCAGACT GNATCTGCAC ACGTTCTGGN TATGGTTCGG 
GCNGGAATGA WTNAGCTGG 


<210> 102 
<211> 33 
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<212> PRT 

<213> Homo sapiens 
<400> 102 

Thr Asn Gly Tyr Gin Lys Thr Gly Asp Val Tyr Lys C ys Pro Val lie 
15 io 15 

His Gly Asn Cys Thr Lys Leu Asn Leu Gly Arg Val Thr Leu Ser Asn 
20 25 30 


<210> 103 

<211> 1151 

<212> PRT 

<213> Homo sapiens 

<400> 103 

Phe Asn Val Asp Val Lys Asn Ser Met Thr Phe Ser Gly Pro Val Glu 
15 10 15 

Asp Met Phe Gly Tyr Thr Val Gin Gin Tyr Glu Asn Glu Glu Gly Lys 
20 25 30 

Trp Val Leu He Gly Ser Pro Leu Val Gly Gin Pro Lys Asn Arg Thr 
35 40 45 

Gly Asp Val Tyr Lys Cys Pro Val Gly Arg Gly Glu Ser Leu Pro Cys 
50 55 60 

Val Lys Leu Asp Leu Pro Val Asn Thr Ser He Pro Asn Val Thr Glu 
65 70 75 80 

Val Lys Glu Asn Met Thr Phe Gly Ser Thr Leu Val Thr Asn Pro Asn 
85 90 95 

Gly Gly Phe Leu Ala Cys Gly Pro Leu Tyr Ala Tyr Arg Cys Gly His 
100 105 HO 

Leu His Tyr Thr Thr Gly He Cys Ser Asp Val Ser Pro Thr Phe Gin 
115 120 125 

Val Val Asn Ser He Ala Pro Val Gin Glu Cys Ser Thr Gin Leu Asp 
130 135 140 

He Val He Val Leu Asp Gly Ser Asn Ser He Tyr Pro Trp Asp Ser 
145 150 155 160 

Val Thr Ala Phe Leu Asn Asp Leu Leu Lys Arg Met Asp He Gly Pro 
165 170 175 

Lys Gin Thr Gin Val Gly He Val Gin Tyr Gly Glu Asn Val Thr His 
180 185 190 

Glu Phe Asn Leu Asn Lys Tyr Ser Ser Thr Glu Glu Val Leu Val Ala 
195 200 205 
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Ala Lys Lys lie Val Gin Arg Gly Gly Arg Gin Thr Met Thr Ala Leu 
210 215 220 

Gly Thr Asp Thr Ala Arg Lys Glu Ala Phe Thr Glu Ala Ar-g Gly Ala 
225 230 235 240 

Arg Arg Gly Val Lys Lys Val Met Val He Val Thr Asp Gly Glu Ser 
245 250 255 

His Asp Asn His Arg Leu Lys Lys Val He Gin Asp Cys Glu Asp Glu 
260 265 270 

Asn He Gin Arg Phe Ser He Ala He Leu Gly Ser Tyr Asn Arg Gly 
275 280 285 

Asn Leu Ser Thr Glu Lys Phe Val Glu Glu He Lys Ser He Ala Ser 
290 295 300 

Glu Pro Thr Glu Lys His Phe Phe Asn Val Ser Asp Glu Leu Ala Leu 
305 310 315 320 

Val Thr He Val Lys Thr Leu Gly Glu Arg He Phe Ala Leu Glu Ala 
325 330 335 

Thr Ala Asp Gin Ser Ala Ala Ser Phe Glu Met Glu Met Ser Gin Thr 
340 345 350 

Gly Phe Ser Ala His Tyr Ser Gin Asp Trp Val Met Leu Gly Ala Val 
355 3 6 o 365 

Gly Ala Tyr Asp Trp Asn Gly Thr Val Val Met Gin Lys Ala Ser Gin 
370 375 380 

He He He Pro Arg Asn Thr Thr Phe Asn Val Glu Ser Thr Lys Lys 
385 390 395 400 

Asn Glu Pro Leu Ala Ser Tyr Leu Gly Tyr Thr Val Asn Ser Ala Thr 
405 410 415 

Ala Ser Ser Gly Asp Val Leu Tyr He Ala Gly Gin Pro Arg Tyr Asn 
420 425 430 

His Thr Gly Gin Val He He Tyr Arg Met Glu Asp Gly Asn He Lys 
435 440 445 

He Leu Gin Thr Leu Ser Gly Glu Gin He Gly Ser Tyr Phe Gly Ser 
450 455 460 

He Leu Thr Thr Thr Asp He Asp Lys Asp Ser Asn Thr Asp He Leu 
465 470 475 480 

Leu Val Gly Ala Pro Met Tyr Met Gly Thr Glu Lys Glu Glu Gin Gly 
485 490 495 

Lys Val Tyr Val Tyr Ala Leu Asn Gin Thr Arg Phe Glu Tyr Gin Met 
500 505 sio 
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Ser Leu Glu Pro He Lys Gin Thr Cys Cys Ser Ser Arg Gin His Asn 
515 520 525 

Ser Cys Thr Thr Glu Asn Lys Asn Glu Pro Cys Gly Ala Arg Phe Gly 
530 535 540 

Thr Ala He Ala Ala Val Lys Asp Leu Asn Leu Asp Gly Phe Asn Asp 
545 550 555 560 

He Val He Gly Ala Pro Leu Glu Asp Asp His Gly Gly Ala Val Tyr 
565 570 575 

He Tyr His Gly Ser Gly Lys Thr He Arg Lys Glu Tyr Ala Gin Arg 
580 585 590 

He Pro Ser Gly Gly Asp Gly Lys Thr Leu Lys Phe Phe Gly Gin Ser 
595 600 605 

He His Gly Glu Met Asp Leu Asn Gly Asp Gly Leu Thr Asp Val Thr 
610 615 620 

lie Gly Gly Leu Gly Gly Ala Ala Leu Phe Trp Ser Arg Asp Val Ala 
625 630 635 640 

Val Val Lys Val Thr Met Asn Phe Glu Pro Asn Lys Val Asn He Gin 
645 650 655 

Lys Lys Asn Cys His Met Glu Gly Lys Glu Thr Val Cys He Asn Ala 
660 665 670 

Thr Val Cys Phe Glu Val Lys Leu Lys Ser Lys Glu Asp Thr He Tyr 
675 680 685 

Glu Ala Asp Leu Gin Tyr Arg Val Thr Leu Asp Ser Leu Arg Gin He 
690 695 700 

Ser Arg Ser Phe Phe Ser Gly Thr Gin Glu Arg Lys Val Gin Arg Asn 
705 710 715 720 

He Thr Val Arg Lys Ser Glu Cys Thr Lys His Ser Phe Tyr Met Leu 
725 730 735 

Asp Lys His Asp Phe Gin Asp Ser Val Arg He Thr Leu Asp Phe Asn 
740 745 750 

Leu Thr Asp Pro Glu Asn Gly Pro Val Leu Asp Asp Ser Leu Pro Asn 
755 760 765 

Ser Val His Glu Tyr He Pro Phe Ala Lys Asp Cys Gly Asn Lys Glu 
770 775 780 

Lys Cys He Ser Asp Leu Ser Leu His Val Ala Thr Thr Glu Lys Asp 
785 790 795 800 

Leu Leu He Val Arg Ser Gin Asn Asp Lys Phe Asn Val Ser Leu Thr 
805 810 815 


Val Lys Asn Thr Lys Asp Ser Ala Tyr Asn Thr Arg Thr lie Val His 
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820 825 830 

Tyr Ser Pro Asn Leu Val Phe Ser Gly lie Glu Ala lie Gin Lys Asp 
835 840 845 

Ser Cys Glu Ser Asn His Asn lie Thr Cys Lys Val Gly Tyr Pro Phe 
850 855 860 

Leu Arg Arg Gly Glu Met Val Thr Phe Lys lie Leu Phe Gin Phe Asn 
865 870 875 880 

Thr Ser Tyr Leu Met Glu Asn Val Thr lie Tyr Leu Ser Ala Thr Ser 
885 890 895 

Asp Ser Glu Glu Pro Pro Glu Thr Leu Ser Asp Asn Val Val Asn lie 
900 905 910 

Ser lie Pro Val Lys Tyr Glu Val Gly Leu Gin Phe Tyr Ser Ser Ala 
915 920 925 

Ser Glu Tyr His lie Ser lie Ala Ala Asn Glu Thr Val Pro Glu Val 
930 935 940 

lie Asn Ser Thr Glu Asp lie Gly Asn Glu lie Asn lie Phe Tyr Leu 
945 950 955 960 

lie Arg Lys Ser Gly Ser Phe Pro Met Pro Glu Leu Lys Leu Ser lie 
965 970 975 

Ser Phe Pro Asn Met Thr Ser Asn Gly Tyr Pro Val Leu Tyr Pro Thr 
980 985 990 

Gly Leu Ser Ser Ser Glu Asn Ala Asn Cys Arg Pro His lie Phe Glu 
995 1000 1005 

Asp Pro Phe Ser lie Asn Ser Gly Lys Lys Met Thr Thr Ser Thr Asp 
1010 1015 1020 

His Leu Lys Arg Gly Thr lie Leu Asp Cys Asn Thr Cys Lys Phe Ala 
1030 1035 1040 

Thr He Thr Cys Asn Leu Thr Ser Ser Asp lie Ser Gin Val Asn Val 
1045 1050 1055 

Ser Leu He Leu Trp Lys Pro Thr Phe He Lys Ser Tyr Phe Ser Ser 
1060 1065 1070 

Leu Asn Leu Thr lie Arg Gly Glu Leu Arg Ser Glu Asn Ala Ser Leu 
1075 1080 1085 

Val Leu Ser Ser Ser Asn Gin Lys Arg Glu Leu Ala lie Gin lie Ser 
1090 1095 1100 

Lys Asp Gly Leu Pro Gly Arg Val Pro Leu Trp Val He Leu Leu Ser 
1110 1115 1120 


Ala Phe Ala Gly Leu Leu Leu Leu Met Leu Leu He Leu Ala Leu Trp 
1125 1130 1135 
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Lys lie Gly Phe Phe Lys Arg Pro Leu Lys Lys Lys Met Glu Lys 
1140 1145 H50 


<210> 104 
<211> 389 
<212> DNA 
<213> Homo sapiens 

<220> 

<221> SITE 
<222> (256) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (279) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (302) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (310) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (347) 

<2 2 3> n equals a ( t,g, or c 
<220> 

<221> SITE 
<222> (349) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (382) 

<223> n equals a,t,g, or c 
<400> 104 

GCCTCCAGTA TTTTGGCTGC AGCATCCACG GGCAATTGGT ACCTCAATGA GGGATGGGCT 
CATCGACCTG GCAGTGGGAG CCCTTGGCAA CGCTGTGATT CTGTGGTCCC GCCCAGTGGT 
TCAGATCAAT GCCAGCCTCC ACTTTGAGCC ATCCAAGATC AACATCTTCC ACAGAGACTG 
CAAGCGCAGT GGCAGGGATG CCACCTGCCT GGCCGCCTTC CTCTGCTTCA CGCCCATCTT 
CCTGGCACCC CATTTNCAAA CAACAACTGT TGGCATCANA TACAACGCCA CCATGGGATG 
ANAGGCGGTN TACACCGAGG GCCCACCTGG ACAAGGCGGG GACCGANTNA CAACAGAACC 
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GTACTGGTTT TCTTCGGCCA GNAACTTGT 38 


<210> 105 

<211> 491 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> SITE 
<222> {11) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (212) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (268) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (274) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (290) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (297) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (317) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (355) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (381) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (384) 

<223> n equals a,t,g, or c 


WO 00/29435 


PCTYUS99/25031 


<220> 

<221> SITE 
<222> (398) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (400) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (404) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (405) 

<223> n equals a,t,g, or c 


<220> 

<221> SITE 
<222> (407) 
<223> n equals 


a, t , g, or c 


<220> 

<221> SITE 
<222> (413) 

<223> n equals a,t 7 g f or c 
<220> 

<221> SITE 
<222> (414) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (415) 

<223> n equals a,t f g, or c 
<220> 

<221> SITE 
<222> (434) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (438) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (443) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 


WO 00/29435 


PCT/US99/25031 


106 

<222> (462) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (465) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (466) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (467) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (482) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (483) 

<223> n equals a,t,g, or c 
<400> 105 


GAATTCGGCA 

WAGGCAGACA 

CTGCTCCTTG 

GGTAGCCATA 

ACTGTGTCCC 

CACTTTCCAG 

60 

GGTGC CAGAC 

CTAGATGGAC 

ATCGTCATTG 

TCCTGGATGG 

GTCCAACAGC 

ATCTACCCCT 

120 

GGGTGGGGGT 

TCAGCACTTC 

CTCATCAACA 

TCCTGAAAAA 

GTTTTACATT 

GGCCCAGGGC 

180 

AGATCCAGGT 

TGGAGTTGTG 

CAGTATGGCG 

ANGATGTGGT 

GCATGAGTTT 

CACCTCAATG 

240 

ACTACAGGTC 

TGTAAAAGAT 

GTGGTGGNAG 

CTGNCAGCCA 

CATTGAGCCN 

GAGAGGNGGG 

300 

ACAGAGACCC 

GGACGGNATT 

TGGCATTGGA 

ATTTGGCACG 

GTTAAAAAAA 

AAGTNGGGCC 

360 

AAAAAAATTT 

TTTTTGGGTC 

NCANATGCTT 

TTGTAGTNAN 

TCCNUTNGGC 

TTNNNAACAA 

420 

ATTTCCCCAA 

ATTNGGTNCC 

C TNGGGCC AT 

ATTTTTTTAT 

TNCCNNNAAT 

TTTGGGGTTT 

480 

GNNTGGGTCA 

A 





491 


<210> 106 
<211> 340 
<212> DNA 
<213> Homo sapiens 

<220> 

<221> SITE 

<222> (5) 

<223> n equals a,t,g, or c 


WO 00/29435 


PCT/US99/25031 


107 

<220> 

<221> SITE 
<222> (188) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (251) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (292) 

<223> n equals a ( t,g, or c 
<220> 

<221> SITE 
<222> (308) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (312) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (327) 

<223> n equals a,t,g, or c 


<400> 106 
GGCANAGGAG 

TAAATGAGCG 

GGACAGCACC 

AAGGAAGACA 

ACGTGGCCCC 

CTTACGCTTC 

60 

CACCTCAAAT 

ACGAGGC TGA 

CGTCCTCTTC 

ACCAGGAGCA 

GCAGCCTGAG 

CCACTACGAA 

120 

GGTCAAGCTC 

AACAGCTCGC 

TGGAGAGATA 

CGATGGTATC 

GGGCCTCCCT 

TCAGCTGCAT 

180 

CTTCAGGNTC 

CAGAACTTGG 

GCTTGTTCCC 

CATCCACGGG 

ATGATGGATG 

GAAGATCACC 

240 

ATTCCCATCG 

NCACCAGGAG 

CGGCAACCGC 

CTTACTGAAG 

TTGAGGGACT 

TNCTCAAGGA 

300 

CGAGGCGNAC 

ANGTCCTGTA 

AACATTNGGG 

GCAATAGCAT 



340 


<210> 107 
<211> 356 
<212> DNA 
<213> Homo sapiens 

<220> 

<221> SITE 
<222> (5) 

<223> n equals a,t,g, or c 


<220> 

<221> SITE 


WO 00/29435 


PCT/US99/25031 


108 

<222> (9) 

<223> n equals a,t,g, or c 


<220> 

<221> SITE 
<222> (84) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (298) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (311) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (312} 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (336) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (339) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (342) 

<223> n equals a,t 7 g, or c 
<220> 

<221> SITE 
<222> (343) 

<223> n equals a,t,g, or c 


<400> 107 

GGCANAGGNG AAATCAATTT CCATCTACTG GGGAACCTGT GGTTGAGGTC CCTAAAAGCA 60 

CTCAAGTACA GATCCATGAA AATNATGGTC AACGCAGCCT TGCAGAGGCA GGTTGCACAG 120 

CCCCTTCATC TTCCGTGAAG GAGGTATCCC AGCCGCCAGA TCGTGTTTGA GAATCTCCAA ISO 

GCAAGAGGAC TGGCAGGTCC CCATCTGGAT CATTGTAGGC AGCACCCTGG GGGGCCTCCT 240 

ACTGCTGGCC CTGCTGGTTC CTGGGCACTG TGGGAAGCTT CGGTTTCTTT AAGAAGTNTC 30 0 

CAGGTCGCAG NNGGGGTAGC CTGGTTCTGG GACCCNAANC CNNAAAATTG CTGGGT 356 


WO 00/29435 


PCT/US99/25031 


109 

<210> 108 
<211> 262 
<212> DNA 
<213> Homo sapiens 

<220> 

<221> SITE 
<222> (76) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (216) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (234) 

<223> n equals a,t,g, or c 
<22Q> 

<221> SITE 
<222> (248) 

<22 3> n equals a^^g, or c 
<220> 

<221> SITE 
<222> (257) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (259) 

<223> n equals a,t,g, or c 


<400> 108 
CTGCACACCG 

GACCCAGCCG 

CCGTGCCGCG 

GGCCATGGTA 

CCTGCCCAGG 

GGCCTGGTGG 

60 

TGGCCTGGGC 

GCTCANCCTG 

TGGCCAGGGT 

TCACGGACAC 

CTTCAACATG 

GTACACCAGG 

120 

AAGCCCCGGG 

TCATCCCTGG 

CTCCAGGACC 

GCCTTCTTTG 

GCTACACAGT 

GCAGCAGCAC 

180 

GACATCAGTG 

GCAATAAGTG 

GCTGGTCCTT 

GGGGCNCCCC 

ACTGGAAACC 

AATNGCTACA 

240 

AAAAAAANGG 

AAAACTNTNC 

AA 




262 


<210> 109 
<211> 327 
<212> DNA 
<213> Homo sapiens 

<220> 

<221> SITE 
<222> (7) 

<223> n equals a,t,g, or c 


WO 00/29435 


PCT/US99/25031 


<220> 

<221> SITE 
<222> (33) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (47) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> {55) 

<2 23> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (67) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (98) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (153) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (186) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (259) 

<223> n equals a,t,g, or c 


<220> 

<221> SITE 
<222> (283) 
<223> n equals 

<220> 

<221> SITE 
<222> (293) 
<223> n equals 


a,t,g, or c 


a, t , g, or c 


<220> 

<221> SITE 

<222> (316) 

<2 23> n equals a,t,g, or c 
<220> 

<221> SITE 

<222> (326) 


WO 00/29435 PCT/US99/25031 

111 

<223> n equals a,t,g, or c 
<400> 109 

GGCAGANCTA CAGCACGGTC CTAAATATCT CGNAGTCAGC AAACCTNCAG TTTCNAGCTT 60 

AATCCANAAG GAGGACTCAA ACGGTAGCAT TGAGTGTNTG AACGAGGAGA GGAGGCTCCA 12 0 

GAAGCAAGTT TGCAACGTCA GCTATCCCTT CTNCCGGGCC AAGGCCAAGG TGGCTTTCCG 180 

TCTTGNATTT TAAGTTCAGC AAATCCATCT TCCTACACCA CCTGGAGATC GAGCCTCGCT 240 

GCAGCAGTGA CAGTAATGNG ATGAGGGGTT AGCCGGGGTT TGNACCCAAG TGNATTTCAC 3 00 

ATTGGCCGGA AATTTNAAAA TTTCCNC 327 


<210> 110 
<211> 301 
<212> DNA 
<213> Homo sapiens 


<220> 

<221> SITE 
<222> (71) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (72) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (90) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (194) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (212) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (241) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (245) 

<223> n equals a,t,g, or c 


<220> 


WO 00/29435 


PCT/US99/25031 


112 

<221> SITE 
<222> (268) 

<223> n equals a,t,g, or c 


<220> 

<221> SITE 

<222> (275) 

<22 3> n equals a,t,g, or c 
<22 0> 

<221> SITE 

<222> (290) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 

<222> (296) 

<22 3> n equals a,t,g, or c 


<400> 110 

CCCATAGATA GGCCCCTGGG GCTCCTGAAA GAATGAACCC AAGAGCAAGG GCTTAATGGT 60 

AACAGCTGCA NNAAGGGAAT GAAGAAAGAN TCTAAGAATG TGGAGAC TG A TGGCCAGGCA 12 0 

AGTGGGACCA GGATACTGAA CGCTGTCCTG AAGAATGAGA AGGTAGCCGG GCTCTGCACC 180 

CACGTGCATT GCANATTGAA CCGCAACTGA AWACATTCCC CCACCAGCTG CAGCCCCTTG 240 

NTCTNCAGTT GCCAACCCTC CCGGGTGNAA TTTTNTTCCC AGGTACCTTN ATGGGNAAGC 3 00 

A 301 


<210> 111 
<211> 478 
<212> DNA 
<213> Homo sapiens 

<220> 

<221> SITE 
<222> (96) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (285) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (359) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (387) 

<223> n equals a,t,g, or c 


WO 00/29435 


PCT/US99/25031 


113 

<220> 

<221> SITE 
<222> (407) 

<2 2 3> n equals a,t,g, or c 


<220> 

<221> SITE 
<222> (433) 

<223> n equals a ( t,g, or c 


<220> 

<221> SITE 
<222> (452) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (469) 

<223> n equals a,t,g, or c 


<400> 111 

GGCACAGTGG CCGTGGGCAG GTCACTCCGG GCATCCAACA CAAGGTCAGG GACACAGTGC 60 

TCATCCTCAT TGCAGCCGTT CCAGAAGGGC ACCTGNGCCA GGGAGAGCCA GGTGTGGGCA 120 

GCTGGGTAGG GACCCGCAGC CCCTCGCCCT CAATGTACAC CAGCTCTGTC TCCACCACAC 180 

TAGACATGGG CTGGCTTTCC TGCACTGTCC CCAGACACCA CACTGCTCTG TCTTGTGCTT 240 

TTCCATAGAT GCTTCCCTCT TTAAAACGAT GCTCAAAGCT TCAGNTCCTC CTGGCTCCCC 3 00 

TCCAGTTTCA TGAATGGAGC TGATGGCACA GAACCCCCAA CCCCATTCAA CCAGCAGANG 3 60 

GTTCTGGTTC AACATTTATT GATCAANAAT GTGTGTGGGG CAAGGGNTTG GTAATGGGGG 42 0 

TTACAAAACA AGNAAGTTGG CAGTTTTGGC CNTCAGTTCA AGGGGAAANT GGTGTTTG 47 8 


<210> 112 
<211> 224 
<212> DWA 
<213> Homo sapiens 

<220> 

<221> SITE 

<222> (12) 

<223> n equals a,t,g ; or c 
<220> 

<221> SITE 

<222> (21) 

<223> n equals a,t,g, or c 


<220> 

<221> SITE 
<222> (49) 


WO 00/29435 


PCT/US99/25031 


114 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (66) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (176) 

<223> n equals a,t,g ( or c 
<220> 

<221> SITE 
<222> (184) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (197) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (212) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (221) 

<223> n equals a,t,g, or c 
<400> 112 

CTATTGCCCC ANATGTTACA NGACGTGTTC GCCTCGTCCG TGTAGGAANT CCCTCAGCTT 60 
CAGTANGCGG TTGCCGCTCC TGGTGGCGAG GGAATGGTGA ATCTTCATCA TCATCCCGTG 120 
GTATGGGGAA CAAACCCAAG TTCTGGATCC TGAAAAATCA ACTGAAAGGA AGGCCNATAC 180 
ATCNTATCTC TCCAACNAAC TGTTGGCTTG ANCTCCTAAT NGTT 224 


<210> 113 
<211> 126 
<212> DNA 
<213> Homo sapiens 

<220> 

<221> SITE 
<222> (72) 

<223> n equals a,t,g, or c 
<400> 113 

GGCACGAGCT GACATCTCTC TGTCACTCTG TTGCAGGCAC CAACAAGAAC GAGACCTCCT 6 0 


TTGGGCTGTG GNAGTCACAG ACGGGCTTTT CCTCGCACGT GGTGGAGGTA TGTGGTGGAG 


120 


WO 00/29435 

GTACGG 


115 


PCT/US99/25031 

126 


<210> 114 
<211> 150 
<212> PRT 
<213> Homo sapiens 

<400> 114 

Met Leu Phe Ala Cys Ala Leu Leu Ala Leu Leu Gly Leu Ala Thr Ser 
15 10 15 

Cys Ser Phe lie Val Pro Arg Ser Glu Trp Arg Ala Leu Pro Ser Glu 
20 25 30 

Cys Ser Ser Arg Leu Gly His Pro Val Arg Tyr Val Val lie Ser His 
35 40 45 

Thr Ala Gly Ser Phe Cys Asn Ser Pro Asp Ser Cys Glu Gin Gin Ala 
50 55 60 

Arg Asn Val Gin His Tyr His Lys Asn Glu Leu Gly Trp Cys Asp Val 
65 70 75 SO 

Ala Tyr Asn Phe Leu lie Gly Glu Asp Gly His Val Tyr Glu Gly Arg 
85 90 95 

Gly Trp Asn lie Lys Gly Asp His Thr Gly Pro lie Trp Asn Pro Met 
100 105 110 

Ser lie Gly lie Thr Phe Met Gly Asn Phe Met Asp Arg Val Arg Lys 
115 120 125 

Ala Gly Pro Pro Cys Cys Pro Lys Ser Ser Gly lie Trp Gly Val Ser 
130 135 140 

Gly Leu Pro Glu He Gin 
145 150 


115 
285 
DNA 

Homo sapiens 


<210> 
<211> 
<212> 
<213> 

<220> 
<221> 
<222> 
<223> 


SITE 

(IS) 

n equals 


a,t,g, or c 


<220> 

<221> SITE 
<222> (263) 
<223> n equals a,t,g, 

<220> 

<221> SITE 


WO 00/29435 


PCT/US99/25031 


116 

<222> (270) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (273) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (276) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (283) 

<223> n equals a,t,g, or c 
<400> 115 


ACCCCAGGCC ATCCGGGNAG CCCAGGGTCT ACTGGCCTGC GGTGTGGCTC AGGGAGCCTG 60 

AGGTCCAACT ATGTGCTCAA AGG AC AC CGG GATGTGCAGC GTACACTCTC TCCAGGCAAC 12 0 

CAGCTCTACC ACCTCATCCA GAATTGGCCA CACTACCGCT CCCCCTGAGG CCCTGCTGAT 180 

CCGCACCCCA TTCCTCCCCT CCCATGGCCA AAAACCCCAC TGTCTCCTTC TCCAATAAAG 24 0 

ATGTAGCTCA AAAAAAAAAA AANAAAAAAN CCNGGNGGGG GGNCC 2 85 


<210> 116 

<211> 457 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> SITE 
<222> (5) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (9) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (27) 

<2 23> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (57) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 


WO 00/29435 


PCT/US99/2503I 


<222> (70) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 

<222> (75) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 

<222> (80) 

<223> n equals a,t,cr, or c 
<220> 

<221> SITE 

<222> (110) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 

<222> (173) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 

<222> (176) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 

<222> (212) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 

<222> (237) 

<22 3> n equals a,t,g, or c 
<220> 

<221> SITE 

<222> (300) 

<223> rx equals a,t,g, or c 
<220> 

<221> SITE 

<222> (310) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 

<222> (318) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 

<222> (320) 

<223> n equals a,t,g, or c 


WO 00/29435 


PCT/US99/25031 


118 

<220> 

<221> SITE 
<222> (326) 

<223> n equals a, t,g, or c 
<220> 

<221> SITE 
<222> (337) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (348) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (396) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (399) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (436) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (439) 

<223> n equals a ( t,g, or c 
<220> 

<221> SITE 
<222> (444) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (451) 

<22 3> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (453) 

<223> n equals -a,t,g, or c 
<400> 116 

GGCANAGCNG GACCCTGCCG CCCTGCNACT ATGTCCCGCC GCTCTATGCT GCTTGCNTGG 60 
GCTCTCCCCN GCCTNCTTCN ACTCGAAGCG GCTCAGGAGA CAGAAGACCN GGCCTGCTGC 120 
AGCCCCATAG TGCCCCGGAA ACAGTGGAAG GCCCTGGTAT CAAAGTGCGC CCNGCNCTGA 180 


AGACCTGCCC TTACGCTATG TGGTGGTATC GNACACGGCG GGCAGCAGCT GCAACANCCC 


240 


WO 00/29435 PCT/US99/25031 

119 

CGATTTGTTG CCAGCAGCAA GCCCGGAATG TGCAGCACTA CCCACATGAA GACACTGGGN 300 
TGGTGCGAAN GTGGGGTNAN AAATTNCTGA TTTGGGNGAA AGAAGGGNTT CTTATAAGAG 360 
GGGGCCTTTG GTTTGGGAAT TTAAAGGGTT GCCCCNTTNA GGTGAATTTT TGGGAAACCC 42 0 

TTGTTCCATT GGGATNAANT TTCNTGGGCA NTNACAT 457 


<210> 117 

<211> 1833 

<212> DNA 

<213> Homo sapiens 

<400> 117 

ACATCCATGG CTCTAATGCT CAGTTTGGTT CTGAGTCTCC TCAAGCTGGG ATCAGGGCAG 60 

TGGCAGGTGT TTGGGC C AG A CAAGCCTGTC CAGGCCTTGG TGGGGGAGGA CGCAGCATTC 120 

TCCTGTTTCC TGTCTCCTAA GACCAATGCA GAGGCCATGG AAGTGCGGTT CTTCAGGGGC 180 

CAGTTCTCTA GCGTGGTCCA CCTCTACAGG GACGGGAAGG ACCAGCCATT TATGCAGATG 240 

CCACAGTATC AAGGCAGGAC AAAACTGGTG AAGGATTCTA TTGCGGAGGG GCGCATCTCT 3 00 

CTGAGGCTGG AAAACATTAC TGTGTTGGAT GCTGGCCTCT ATGGGTGCAG GATTAGTTCC 3 60 

CAGTCTTACT AC C AGAAGGC CATCTGGGAG CTACAGGTGT CAGCACTGGG CTCAGTTCCT 42 0 

CTCATTTCCA TCACGGGATA TGTTGATAGA GACATCCAGC TACTCTGTCA GTCCTCGGGC 480 

TGGTTCCCCC GGCCCACAGC GAAGTGGAAA GGTCCACAAG GACAGGATTT GTCCACAGAC 540 

TCCAGGACAA ACAGAGACAT GCATGGCCTG TTTGATGTGG AGATCTCTCT GACCGTCCAA 600 

GAGAACGCCG GGAGCATATC CTGTTCCATG CGGCATGCTC ATCTGAGCCG AGAGGTGGAA 660 

TCCAGGGTAC AGATAGGAGA TACCTTTTTC GAGCCTATAT CGTGGCACCT GGCTACCAAA 72 0 

GTACTGGGAA TACTCTGCTG TGGCCTATTT TTTGGCATTG TTGGACTGAA GATTTTCTTC 7 80 

TCCAAATTCC AGTGGAAAAT CCAGGCGGAA CTGGACTGGA GAAGAAAGCA CGGACAGGCA 840 

GAATTGAGAG ACGCCCGGAA ACACGCAGTG GAGGTGACTC TGGATCCAGA GACGGCTCAC 900 

CCGAAGCTCT GCGTTTCTGA TCTGAAAACT GTAACCCATA GAAAAGCTCC CCAGGAGGTG 9 60 

CCTCACTCTG AGAAGAGATT TACAAGGAAG AGTGTGGTGG CTTCTCAGAG TTTCCAAGCA 102 0 

GGGAAACATT ACTGGGAGGT GGACGGAGGA CACAATAAAA GGTGGCGCGT GGGAGTGTGC 10 80 

CGGGATGATG TGGACAGGAG GAAGGAGTAC GTGACTTTGT CTCCCGATCA TGGGTACTGG 1140 

GTCCTCAGAC TGAATGGAGA ACATTTGTAT TTCACATTAA ATCCCCGTTT TATCAGCGTC 12 00 

TTCCCCAGGA CCCCACCTAC AAAAATAGGG GTCTTCCTGG ACTATGAGTG TGGGACCATC 12 60 


WO 00/29435 

TCCTTCTTCA ACATAAATGA CCAGTCCCTT 
TTATTGAGGC CCTACATTGA GTATCCGTCC 
AAGCAACAGT GAGTCCTCCT CACAGGCAAC 
GGATGAATCA CATCCCACAT TCTTCTTTAG 
GTCCCGCAGC AGCCGGCCAA GGTGGCTTCC 
GAGTCAGGTG TCATGGCTGC CCTGAGCTGG 
TGCTCTCACT CCATCTGGCT AAGTGATCTT 
AGGAATTCCC ATCTCACAGG CTGTGGTGTA 
TTAGATCTTA TTGATGACAG AGTGTATCCT 
TAAAAAAAAA AAAAAAAAAA AAAAAAAAAA 


PCTYUS99/25031 

120 

ATTTATACCC TGACATGTCG GTTTGAAGGC 132 0 

TATAATGAGC AAAATGGAAC TCCCAGAGAC 13 80 

CACGCCCTTC CTCCCCAGGG GTGAAATGTA 1440 

GGATATTAAG GTCTCTCTCC CAGATCCAAA 1500 

AGATGAAGGG GGACTGGCCT GTCCACATGG 15 60 

GAGGGAAGAA GGCTGACATT ACATTTAGTT 1620 

GAAATACCAC CTCTCAGGTG AAGAACCGTC 1680 

GATTAAGTAG ACAAGGAATG TGAATAATGC 17 4 0 

AATGGTTTGT TCATTATATT ACACTTTCAG 1800 

AAA 1833 


<210> 118 

<211> 1386 

<212> DNA 

<213> Homo sapiens 

<400> 118 

ATGGCTCTAA TGCTGAGTTT GGTTCTGAGT CTCCTCAAGC TGGGATCAGG GCAGTGGCAG 60 

GTGTTTGGGC CAGACAAGCC TGTCCAGGCC TTGGTGGGGG AGGACGCAGC ATTCTCCTGT 120 

TTCCTGTCTC CTAAGACCAA TGCAGAGGCC ATGGAAGTGC GGTTCTTCAG GGGCCAGTTC 180 

TCTAGCGTGG TCCACCTCTA CAGGGACGGG AAGGACCAGC CATTTATGCA GATGCCACAG 240 

TATCAAGGCA GGACAAAACT GGTGAAGGAT TCTATTGCGG AGGGGCGCAT CTCTCTGAGG 300 

CTGGAAAACA TTAC TGTGTT GGATGCTGGC CTCTATGGGT GCAGGATTAG TTCCCAGTCT 360 

TACTACCAGA AGGCCATCTG GGAGCTACAG GTGTCAGCAC TGGGCTCAGT TCCTCTCATT 420 

TCCATCACGG GATATGTTGA TAGAGACATC CAGCTACTCT GTCAGTCCTC GGGCTGGTTC 480 

CCCCGGCCCA CAGCGAAGTG GAAAGGTCCA CAAGGACAGG ATTTGTCCAC AGACTCCAGG 540 

ACAAACAGAG ACATGCATGG CCTGTTTGAT GTGGAGATCT CTCTGACCGT CCAAGAGAAC 600 

GCCGGGAGCA TATCCTGTTC CATGCGGCAT GCTCATCTGA GCCGAGAGGT GGAATCCAGG 660 

GTACAGATAG GAGATACCTT TTTCGAGCCT AT ATC GTGGC ACCTGGC TAG CAAAGTACTG 72 0 

GGAATACTCT GCTGTGGCCT ATTTTTTGGC ATTGTTGGAC TGAAGATTTT CTTCTCCAAA 780 

TTCCAGTGGA AAATCCAGGC GGAACTGGAC TGGAGAAGAA AGCACGGACA GGCAGAATTG 840 

AGAGACGCCC GGAAACACGC AGTGGAGGTG ACTCTGGATC CAGAGACGGC TCACCCGAAG 900 


WO 00/29435 

CTCTGCGTTT CTGATCTGAA AACTGTAAC C 

TCTGAGAAGA GATTTACAAG GAAGAGTGTG 

CATTACTGGG AGGTGGACGG AGGACACAAT 

GATGTGGACA GGAGGAAGGA GTACGTGACT 

AGACTGAATG GAGAACATTT GTATTTCACA 

AGGACCCCAC CTACAAAAAT AGGGGTCTTC 

TTCAACATAA ATGACCAGTC CCTTATTTAT 

AGGCCCTACA TTGAGTATCC GTCCTATAAT 
CAGTGA 


PCT/US99/25031 

121 

CATAGAAAAG CTCCCCAGGA GGTGCCTCAC 960 
GTGGCTTCTC AGAGTTTCCA AG C AGGGAAA 102 0 

AAAAGGTGGC GCGTGGGAGT GTGC CGGGAT 10 8 0 

TTGTCTCCCG ATCATGGGTA CTGGGTCCTC 1140 
TTAAATCCCC GTTTTATCAG CGTCTTCCCC 12 00 

CTGGACTATG AGTGTGGGAG CATCTCCTTC 12 6 0 

ACCCTGACAT GTCGGTTTGA AGGCTTATTG 132 0 

GAGCAAAATG GAACTCCCAG AGACAAGCAA 13 8 0 

1386 


<210> 119 
<211> 461 
<212> PRT 
<213> Homo sapiens 

<400> 119 

Met Ala Leu Met Leu Ser Leu Val Leu Ser Leu Leu Lys Leu Gly Ser 
15 10 15 

Gly Gin Trp Gin Val Phe Gly Pro Asp Lys Pro Val Gin Ala Leu Val 
20 25 30 

Gly Glu Asp Ala Ala Phe Ser Cys Phe Leu Ser Pro Lys Thr Asn Ala 
35 40 45 

Glu Ala Met Glu Val Arg Phe Phe Arg Gly Gin Phe Ser Ser Val Val 
50 55 60 

His Leu Tyr Arg Asp Gly Lys Asp Gin Pro Phe Met Gin Met Pro Gin 
65 70 75 80 

Tyr Gin Gly Arg Thr Lys Leu Val Lys Asp Ser lie Ala Glu Gly Arg 
85 90 95 

lie Ser Leu Arg Leu Glu Asn lie Thr Val Leu Asp Ala Gly Leu Tyr 
100 105 110 

Gly Cys Arg lie Ser Ser Gin Ser Tyr Tyr Gin Lys Ala lie Trp Glu 
115 120 125 

Leu Gin Val Ser Ala Leu Gly Ser Val Pro Leu lie Ser lie Thr Gly 
130 135 140 

Tyr Val Asp Arg Asp lie Gin Leu Leu Cys Gin Ser Ser Gly Trp Phe 
145 150 155 160 

Pro Arg Pro Thr Ala Lys Trp Lys Gly Pro Gin Gly Gin Asp Leu Ser 


WO 00/29435 PCT/US99/25031 

122 

165 170 175 

Thr Asp Ser Arg Thr Asn Arg Asp Met His Gly Leu Phe Asp Val Glu 
180 185 igo 

He Ser Leu Thr Val Gin Glu Asn Ala Gly Ser He Ser Cys Ser Met 
195 200 205 

Arg His Ala His Leu Ser Arg Glu Val Glu Ser Arg Val Gin lie Gly 
210 215 220 

Asp Thr Phe Phe Glu Pro lie Ser Trp His Leu Ala Thr Lys Val Leu 
225 230 235 240 

Gly He Leu Cys Cys Gly Leu Phe Phe Gly He Val Gly Leu Lys He 
245 250 255 

Phe Phe Ser Lys Phe Gin Trp Lys He Gin Ala Glu Leu Asp Trp Arg 
260 265 270 

Arg Lys His Gly Gin Ala Glu Leu Arg Asp Ala Arg Lys His Ala Val 
275 280 285 

Glu Val Thr Leu Asp Pro Glu Thr Ala His Pro Lys Leu Cys Val Ser 
290 295 300 

Asp Leu Lys Thr Val Thr His Arg Lys Ala Pro Gin Glu Val Pro His 
305 310 315 320 

Ser Glu Lys Arg Phe Thr Arg Lys Ser Val Val Ala Ser Gin Ser Phe 
325 330 335 

Gin Ala Gly Lys His Tyr Trp Glu Val Asp Gly Gly His Asn Lys Arg 
340 345 350 

Trp Arg Val Gly Val Cys Arg Asp Asp Val Asp Arg Arg Lys Glu Tyr 
355 360 365 

Val Thr Leu Ser Pro Asp His Gly Tyr Trp Val Leu Arg Leu Asn Gly 
370 375 380 

Glu His Leu Tyr Phe Thr Leu Asn Pro Arg Phe He Ser Val Phe Pro 
385 390 395 400 

Arg Thr Pro Pro Thr Lys He Gly Val Phe Leu Asp Tyr Glu Cys Gly 
405 410 415 

Thr He Ser Phe Phe Asn He Asn Asp Gin Ser Leu He Tyr Thr Leu 
420 425 430 

Thr Cys Arg Phe Glu Gly Leu Leu Arg Pro Tyr He Glu Tyr Pro Ser 
435 440 445 

Tyr Asn Glu Gin Asn Gly Thr Pro Arg Asp Lys Gin Gin 
450 455 460 


<210> 120 


WO 00/29435 PCT/US99/25031 


123 

<211> 1152 

<212> DNA 

<213> Homo sapiens 

<400> 120 

ACCTTTTTCG AGCCTATATC GTGGCACCTG GCTACCAAAG TACTGGGAAT ACTCTGCTGT 60 

GGCCTATTTT TTGGCATTGT TGGACTGAAG ATTTTCTTCT CCAAATTCCA GTGGAAAATC 12 0 

CAGGCGGAAC TGGACTGGAG AAGAAAGCAC GGACAGGCAG AATTGAGAGA CGCCCGGAAA 180 

CACGCAGTGG AGGTG AC TCT GGATCCAGAG ACGGCTCACC CGAAGCTCTG CGTTTCTGAT 240 

CTGAAAACTG TAACCCATAG AAAAGCTCCC CAGGAGGTGC CTCACTCTGA GAAGAGATTT 300 

ACAAGGAAGA GTGTGGTGGC TTCTCAGAGT TTCCAAGCAG GGAAACATTA CTGGGAGGTG 360 

GACGGAGGAC ACAATAAAAG GTGGCGCGTG GGAGTGTGCC GGGATGATGT GGACAGGAGG 42 0 

AAGGAGTACG TGACTTTGTC TCCCGATCAT GGGTAC TGGG TCCTCAGACT GAATGGAGAA 480 

CATTTGTATT TCACATTAAA TCCCCGTTTT ATCAGCGTCT TCCCCAGGAC CCCACCTACA 540 

AAAATAGGGG TCTTCCTGGA CTATGAGTGT GGGACCATCT CCTTCTTCAA CATAAATGAC 600 

CAGTCCCTTA TTTATACCCT GACATGTCGG TTTGAAGGCT TATTGAGGCC CTACATTGAG 6 60 

TATCCGTCCT AT AATG AG C A AAATGGAACT C C C AG AG AC A AGCAACAGTG AGTCCTCCTC 72 0 

ACAGGCAACC ACGCCCTTCC TCCCCAGGGG TGAAATGTAG GATGAATCAC ATCCCACATT 7 80 

CTTCTTTAGG GATATTAAGG TCTCTCTCCC AGATCCAAAG TCCCGCAGCA GCCGGCCAAG 840 

GTGGCTTCCA GATGAAGGGG GACTGGCCTG TCCACATGGG AGTGAGGTGT CATGGCTGCC 900 

CTGAGCTGGG AGGGAAGAAG GCTGACATTA CATTTAGTTT GCTCTCACTC CATC TGGCT A 960 

AGTGATCTTG AAATACCACC TCTCAGGTGA AGAAC CGTC A GGAATTC CCA TCTCACAGGC 1020 

TGTGGTGTAG ATTAAGTAGA CAAGGAATGT GAATAATGCT TAGATCTTAT TGATGACAGA 1080 

GTGTATCCTA ATGGTTTGTT CATTATATTA CACTTTCAGT AAAAAAAAAA AAAAAAAAAA 1140 

AAAAAAAAAA AA 1152 


<210> 121 
<211> 526 
<212> PRT 
<213> Homo sapiens 

<400> 121 

Met Ala Val Phe Pro Asn Ser Cys Leu Ala Gly Cys Leu Leu lie Phe 
15 10 is 


lie Leu Leu Gin Leu Pro Lys Leu Asp Ser Ala Pro Phe Asp Val lie 
20 25 30 


WO 00/29435 PCT/US99/25031 


Gly Pro Gin Glu Pro He Leu Ala Val Val Gly Glu Asp Ala Glu Leu 
35 40 45 

Pro Cys Arg Leu Ser Pro Asn Val Ser Ala Lys Gly Met Glu Leu Arg 
50 55 60 

Trp Phe Arg Glu Lys Val Ser Pro Ala Val Phe Val Ser Arg Glu Gly 
65 70 75 80 

Gin Glu Gin Glu Gly Glu Glu Met Ala Glu Tyr Arg Gly Arg Val Ser 
85 90 95 

Leu Val Glu Asp His He Ala Glu Gly Ser Val Ala Val Arg He Gin 
100 105 HO 

Glu Val Lys Ala Ser Asp Asp Gly Glu Tyr Arg Cys Phe Phe Arg Gin 
115 120 125 

Asp Glu Asn Tyr Glu Glu Ala He Val His Leu Lys Val Ala Ala Leu 
130 135 140 

Gly Ser Asp Pro His He Ser Met Lys Val Gin Glu Ser Gly Glu He 
145 150 155 160 

Gin Leu Glu Cys Thr Ser Val Gly Trp Tyr Pro Glu Pro Gin Val Gin 
165 170 175 

Trp Arg Thr His Arg Gly Glu Glu Phe Pro Ser Met Ser Glu Ser Arg 
180 185 190 

Asn Pro Asp Glu Glu Gly Leu Phe Thr Val Arg Ala Ser Val He He 
195 200 205 

Arg Asp Ser Ser Met Lys Asn Val Ser Cys Cys He Arg Asn Leu Leu 
210 215 220 

Leu Gly Gin Glu Lys Asp Val Glu Val Ser He Pro Ala Ser Phe Phe 
225 230 235 240 

Pro Arg Leu Thr Pro Trp Met Val Ala Val Ala Val He Leu Val Val 
245 250 255 

Leu Gly Leu Leu Thr He Gly Ser He Phe Phe Thr Trp Arg Leu Tyr 
260 265 270 

Lys Glu Arg Ser Arg Gin Arg Arg Asn Glu Phe Ser Ser Lys Glu Lys 
275 280 285 

Leu Leu Glu Glu Leu Lys Trp Lys Arg Ala Thr Leu His Ala Val Asp 
290 295 300 

Val Thr Leu Asp Pro Asp Thr Ala His Pro His Leu Phe Leu Tyr Glu 
305 310 315 320 


Asp Ser Lys Ser Val Arg Leu Glu Asp Ser Arg Gin Lys Leu Pro Glu 
325 330 335 


WO 00/29435 PCT/US99/25031 

125 

Lys Pro Glu Arg Phe Asp Ser Trp Pro Cys Val Met Gly Arg Glu Ala 
340 345 350 

Phe Thr Ser Gly Arg His Tyr Trp Glu Val Glu Val Gly Asp Arg Thr 
355 360 365 

Asp Trp Ala lie Gly Val Cys Arg Glu Asn Val Met Lys Lys Gly Phe 
370 375 380 

Asp Pro Met Thr Pro Glu Asn Gly Phe Trp Ala Val Glu Leu Tyr Gly 
385 390 395 400 

Asn Gly Tyr Trp Ala Leu Thr Pro Leu Arg Thr Pro Leu Pro Leu Ala 
405 410 415 

Gly Pro Pro Arg Arg Val Gly Val Phe Leu Asp Tyr Glu Ser Gly Asp 
420 425 430 

lie Phe Phe Tyr Asn Met Thr Asp Gly Ser His He Tyr Thr Phe Ser 
435 440 445 

Lys Ala Ser Phe Ser Gly Pro Leu Arg Pro Phe Phe Cys Leu Trp Ser 
450 455 460 

Cys Gly Lys Lys Pro Leu Thr He Cys Pro Val Thr Asp Gly Leu Glu 
465 470 475 480 

Gly Val Met Val Val Ala Asp Ala Lys Asp He Ser Lys Glu He Pro 
485 490 495 

Leu Ser Pro Met Gly Glu Asp Ser Ala Ser Gly Asp He Glu Thr Leu 
500 505 510 

His Ser Lys Leu He Pro Leu Gin Pro Ser Gin Gly Val Pro 
51 5 520 525 


<210> 122 
<211> 469 
<212> DNA 
<213> Homo sapiens 

<220> 

<221> SITE 
<222> (63) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (289) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (338) 

<223> n equals a,t,g, or c 


<220> 


WO 00/29435 


PCT/US99/25031 


12 6 

<221> SITE 
<222> (357) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (358) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (362) 

<223> n equals a, t,g, or c 
<220> 

<221> SITE 
<222> (364) 

<223> n equals a,t r g, or c 
<220> 

<221> SITE 
<222> (381) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (389) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (391) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (437) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (447) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (454) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (458) 

<223> n equals a,t,g, or c 
<400> 122 

GGCAGAGCCT GTCATCCGTT TCCATGCCGT GAGGTCCATT CACAGAACAC ATCCATGGCT 60 


CTNATGCTCA GTTTGGTTCT GAGTCTCCTC AAGCTGGGAT CAGGGCAGTG GCAGGTGTTT 


120 


WO 00/29435 PCT/US99/25031 


127 

GGGCCAGACA AGCCTGTCCA GGCCTTGGTG GGGGAGGACG CAGCATTCTC CTGTTTCCTG 18 0 

TCTCCTAAGA ACCAATGCAG AGGCCATGGT AAGTGCGGTT CTTCAGGGGC CAGTTTCTCT 240 

AGCGTGGTCC ACCTCTTACA GGGACGGGAA GGACCAGCCC ATTTTATGNC AGATGGCCAC 300 

AGTATTCAAG GCCAGGACAA AAAATGGGTG GAAGGTTNCT ATTTGCGGAG GGGCGCNNTT 360 

TNTNTTGAGG GTGGGAAAAA NATTTACTNT NTTTGGATTG CTGGGCCCCT TATGGGTGCC 42 0 

AGGGTTAAGT TTCCCANTTT TTAATTNACC CGAMAGGNCC ATTTGGGGG 46 9 


<210> 123 
<211> 368 
<212> DNA 
<213> Homo sapiens 

<220> 

<221> SITE 

<222> (81) 

<22 3> n equals a,t,g, or c 
<220> 

<221> SITE 

<222> (109) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 

<222> (133) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 

<222> (210) 

<223> n equals a f t,g, or c 
<220> 

<221> SITE 

<222> (228) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 

<222> (242) 

<223> n equals a,t,g r or c 
<220> 

<221> SITE 

<222> (245) 

<223> n equals a,t,g, or c 


<220> 

<221> SITE 
<222> (249) 


WO 00/29435 
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128 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (251) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> {268) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (329) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (335) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (339) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (353) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (362) 

<223> n equals a,t,g, or c 
<400> 123 


GGCAGAGAGG ATTTGTCCAC AGACTCCAGG ACAAACAGAG ACATGCATGG CCTGTTTGAT 6 0 

GTGGAGATCT CTCTGACCGT NCAAGAGAAC GCCGGGAGCA TATCCTGTNC CATGCGGCAT 12 0 

GCTCATCTAA GCNCGAGAAG GTGGGAATCC AGGGTACAGA TAGGAGATAC CTTTTCGAAG 180 

CCTATATCGT GGCACCTGGC TACCAAAGTN CTGGGAAATA CTCTGCTNGT GGCCTATTTT 240 

TNGGNATTNT NGGAC TAAAG ATATTTTNCT CCAAATTCCA GTGTAAGCAA GGGAGAAGGG 3 00 

GCATGGGCCC GTGCCTTATT CCATGGTTNC CAGCNGGGNC AGGATCAGAG ATNGCTCCCA 360 

CNTCCAGT -a fi o 


<210> 124 

<211> 27 

<212> PRT 

< 2 1 3 > Homo sap i ens 


WO 00/29435 


129 


PCT7US99/25031 


<400> 124 

Gly Thr Leu Val Ala Glu Lys His Val Leu Thr Ala Ala His Cys lie 
1 5 10 15 

His Asp Gly Lys Thr Tyr Val Lys Gly Thr Gin 
20 25 


<210> 125 
<211> 413 
<212> PRT 

<213> Homo sapiens 


<400> 125 

Gly Thr Arg Gly Gin Ala Trp Glu 

1 5 

His Leu Ser Glu Arg Arg Ser Glu 
20 

Arg Gly Thr Val Leu Gly Met Ala 
35 40 

Leu Phe Phe Leu Leu Cys Ala Val 
50 55 


Pro Arg Ala Leu Ser Arg Arg Pro 
10 15 

Pro Arg Pro Gly Arg Ala Ala Arg 
25 30 

Gly lie Pro Gly Leu Leu Phe Leu 
45 

Gly Gin Val Ser Pro Tyr Ser Ala 
60 


Pro Trp Lys Pro Thr Trp Pro Ala 
65 70 

Gin Ser Thr Leu Asn Leu Ala Lys 
85 

Leu Glu Val Ser Ser Ser Cys Gly 
100 

Leu Pro Thr Tyr Glu Glu Ala Lys 
115 120 

Tyr Ala Asn Gly Ser Arg Thr Glu 
130 135 


Tyr Arg Leu Pro Val Val Leu Pro 
75 80 

Pro Asp Phe Gly Ala Glu Ala Lys 
90 35 

Pro Gin Cys His Lys Gly Thr Pro 
105 HO 

Gin Tyr Leu Ser Tyr Glu Thr Leu 
125 

Thr Gin Val Gly lie Tyr lie Leu 
140 


Ser Ser Ser Gly Asp Gly Ala Gin 
145 150 

Lys Ser Arg Arg Lys Arg Gin lie 
165 

He Phe Gly Lys Asp Phe Leu Leu 

180 

Lys Leu Ser Thr Gly Cys Thr Gly 
195 200 


His Arg Asp Ser Gly Ser Ser Gly 
155 160 

Tyr Gly Tyr Asp Ser Arg Phe Ser 
170 175 

Asn Tyr Pro Phe Ser Thr Ser Val 
185 190 

Thr Leu Val Ala Glu Lys His Val 
205 


Leu Thr Ala Ala His Cys He His Asp Gly Lys Thr Tyr Val Lys Gly 
210 215 220 

Thr Gin Lys Leu Arg Val Gly Phe Leu Lys Pro Lys Phe Lys Asp Gly 


WO 00/29435 PCT/US9 9/25031 

130 

225 230 235 240 

Gly Arg Gly Ala Asn Asp Ser Thr Ser Ala Met Pro Glu Gin Met Lys 
245 250 255 

Phe Gin Trp lie Arg Val Lys Arg Thr His Val Pro Lys Gly Trp lie 
260 265 270 

Lys Gly Asn Ala Asn Asp He Gly Met Asp Tyr Asp Tyr Ala Leu Leu 
275 280 285 

Glu Leu Lys Lys Pro His Lys Arg Lys Phe Met Lys He Gly Val Ser 
290 295 300 

Pro Pro Ala Lys Gin Leu Pro Gly Gly Arg He His Phe Ser Gly Tyr 
305 310 315 320 

Asp Asn Asp Arg Pro Gly Asn Leu Val Tyr Arg Phe Cys Asp Val Lys 
325 330 335 

Asp Glu Thr Tyr Asp Leu Leu Tyr Gin Gin Cys Asp Ser Gin Pro Gly 
340 345 350 

Ala Ser Gly Ser Gly Val Tyr Val Arg Met Trp Lys Arg Gin His Gin 
355 360 365 

Lys Trp Glu Arg Lys He He Gly Met He Ser Gly His Gin Trp Val 
370 375 380 

Asp Met Asp Gly Ser Pro Gin Glu Phe Thr Arg Gly Cys Ser Glu He 
385 390 395 400 

Thr Pro Leu Gin Tyr He Pro Asp He Ser He Gly Val 
405 410 


<210> 126 
<211> 383 
<212> PRT 
<213> Homo sapiens 

<400> 126 

Met Ala Gly He Pro Gly Leu Leu Phe Leu Leu Phe Phe Leu Leu Cys 
15 10 15 

Ala Val Gly Gin Val Ser Pro Tyr Ser Ala Pro Trp Lys Pro Thr Trp 
20 25 30 

Pro Ala Tyr Arg Leu Pro Val Val Leu Pro Gin Ser Thr Leu Asn Leu 
35 40 45 

Ala Lys Pro Asp Phe Gly Ala Glu Ala Lys Leu Glu Val Ser Ser Ser 
50 55 60 

Cys Gly Pro Gin Cys His Lys Gly Thr Pro Leu Pro Thr Tyr Glu Glu 
65 70 75 80 


Ala Lys Gin Tyr Leu Ser Tyr Glu Thr Leu Tyr Ala Asn Gly Ser Arg 


WO 00/29435 PCT/US99/25031 

131 

85 90 95 

Thr Glu Thr Gin Val Gly He Tyr lie Leu Ser Ser Ser Gly Asp Gly 
100 105 no 

Ala Gin His Arg Asp Ser Gly Ser Ser Gly Lys Ser Arg Arg Lys Arg 
115 120 125 

Gin He Tyr Gly Tyr Asp Ser Arg Phe Ser lie Phe Gly Lys Asp Phe 
1 30 135 140 

Leu Leu Asn Tyr Pro Phe Ser Thr Ser Val Lys Leu Ser Thr Gly Cys 
145 150 155 160 

Thr Gly Thr Leu Val Ala Glu Lys His Val Leu Thr Ala Ala His Cys 
165 170 175 

He His Asp Gly Lys Thr Tyr Val Lys Gly Thr Gin Lys Leu Arg Val 
180 185 190 

Gly Phe Leu Lys Pro Lys Phe Lys Asp Gly Gly Arg Gly Ala Asn Asp 
195 200 205 

Ser Thr Ser Ala Met Pro Glu Gin Met Lys Phe Gin Trp He Arg Val 
210 215 220 

Lys Arg Thr His Val Pro Lys Gly Trp He Lys Gly Asn Ala Asn Asp 
225 230 235 240 

lie Gly Met Asp Tyr Asp Tyr Ala Leu Leu Glu Leu Lys Lys Pro His 
245 250 255 

Lys Arg Lys Phe Met Lys He Gly Val Ser Pro Pro Ala Lys Gin Leu 
260 265 270 

Pro Gly Gly Arg He His Phe Ser Gly Tyr Asp Asn Asp Arg Pro Gly 
275 280 285 

Asn Leu Val Tyr Arg Phe Cys Asp Val Lys Asp Glu Thr Tyr Asp Leu 
290 295 300 

Leu Tyr Gin Gin Cys Asp Ala Gin Pro Gly Ala Ser Gly Ser Gly Val 
305 310 315 320 

Tyr Val Arg Met Trp Lys Arg Gin Gin Gin Lys Trp Glu Arg Lys He 
325 330 335 

He Gly He Phe Ser Gly His Gin Trp Val Asp Met Asn Gly Ser Pro 
340 345 350 

Gin Asp Phe Asn Val Ala Val Arg He Thr Pro Leu Lys Tyr Ala Gin 
355 360 365 

He Cys Tyr Trp He Lys Gly Asn Tyr Leu Asp Cys Arg Glu Gly 
3V0 375 380 


<210> 127 


WO 00/29435 
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132 

<211> 269 
<212> PRT 
<213> Homo sapiens 


<400> 127 

Met He Arg Thr 

1 

Cys Gly Asp Pro 
20 

Glu Glu Ala Arg 
35 

Ser Ser Asn Gly 
50 

Asn Ser Trp Val 
65 

Tyr Arg Val Gly 


Ser Leu Ala Val 
100 

Ser Asn Gin He 

115 

Asn Pro Val Ser 
130 

Ala Gly Thr He 
145 

Gly Arg Leu Gin 


Arg Leu Leu Val 
180 

Gly Ser Ser Val 
195 

He Ser Ser Cys 
210 

Ser Asp Gly Arg 
225 

Leu Gly Cys Asn 


Asn Tyr He Asp 
260 


Leu Leu Leu Ser 
5 

Thr Tyr Pro Pro 


Pro Asn Ser Trp 
40 

Lys Trp Tyr His 
55 

Leu Thr Ala Ala 
70 

Leu Gly Arg His 
85 

Ser Val Ser Lys 


Ser Lys Gly Asn 
120 

Leu Thr Asp Lys 
135 

Leu Pro Asn Asn 
150 

Thr Asn Gly Ala 
165 

Val Asp Tyr Ala 


Lys Thr Ser Met 
200 

Asn Gly Asp Ser 
215 

Trp Gin Val His 
230 

Tyr Tyr His Lys 
245 

Trp He Asn Ser 


Thr Leu Val Ala 
10 

Tyr Val Thr Arg 
25 

Pro Trp Gin Val 


Thr Cys Gly Gly 
60 

His Cys He Ser 
75 

Asn Leu Tyr Val 
90 

He Val Val His 
105 

Asp He Ala Leu 


He Gin Leu Ala 
140 

Tyr Pro Cys Tyr 
155 

Val Pro Asp Val 
170 

Thr Cys Ser Ser 
185 

He Val Ala Gly 


Gly Gly Pro Leu 
220 

Gly He Val Ser 
235 

Pro Ser Val Phe 
250 

Val He Ala Asn 
265 


Gly Ala Leu Ser 
15 

Val Val Gly Gly 
30 

Ser Leu Gin Tyr 
45 

Ser Leu lie Ala 


Ser Ser Arg Thr 
80 

Ala Glu Ser Gly 
95 

Lys Asp Trp Asn 
110 

Leu Lys Leu Ala 
125 

Cys Leu Pro Pro 


Val Thr Gly Trp 
160 

Leu Gin Gin Gly 
175 

Ser Ala Trp Trp 
190 

Gly Asp Gly Val 
205 

Asn Cys Gin Ala 


Phe Gly Ser Arg 
240 

Thr Arg Val Ser 
255 

Asn 


<210> 128 


WO 00/29435 
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133 

<211> 328 
<212> DNA 
<213> Homo sapiens 


<220> 

<221> SITE 

<222> (30) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 

<222> (216) 

<223> n equals a,t,g, or c 


<400> 128 
TGAAACGCAC 

CCATGTGCCC 

AAGGGTTGGN 

TCAAGGGCAA 

TGCCAATGAC 

ATCGGCATGG 

60 

ATTATGATTA 

TGCCCTCCTG 

GAACTCAAAA 

AGCCCCACAA 

GAGAAAATTT 

ATGAAGATTG 

120 

GGGTGAGCCC 

TCCTGCTAAG 

CAGCTGCCAG 

GGGGCAGAAT 

TCACTTCTCT 

GGTTATGACA 

ISO 

ATGACCGACC 

AGGCAATTTG 

GTGTATCGCT 

TCTGTNACGT 

CAAAGACGAG 

ACCTATGACT 

240 

TGCTCTACCA 

GCAATGCGAT 

GCCCAGCCAG 

GGGCCAGCGG 

GTCTGGGGTC 

TATGTTAGGA 

300 

TGTGGGAGAG 

AC AGC AG CAT 

GAAGTGGG 




328 


<210> 129 
<211> 495 
<212> DNA 
<213> Homo sapiens 

<220> 

<221> SITE 
<222> (23) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (260) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (291) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (295) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (305) 

<223> n equals a,t,g, or c 
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<220> 

<221> SITE 
<222> (319) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (331) 

<223> n equals a,t,g, or- c 
<220> 

<221> SITE 
<222> (352) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (364) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (390) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (400) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (402) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (406) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (428) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (429) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (448) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 


WO 00/29435 
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<222> (461) 

<22 3> n equals a,t, 


<220> 

<221> SITE 
<222> (463) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (465) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (469) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (470) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (476) 

<2 23> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (482) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (489) 

<223> n equals a,t,g, or c 


<400> 129 
AATTCGGCAC 

AGGCTCGGCC 

CGNAGGACTC 

AGGGGCTTCA 

GGAAAGTCTC 

GAAGGAAGCG 

60 

GC AG AT T T AT 

GGCTATGACA 

GCAGGTTCAG 

CATTTTTGGG 

AAGGACTTCC 

TGCTCAACTA 

120 

CCCTTTCTCA 

ACATCAGTGA 

AGTTATCCAC 

GGGCTGCACC 

GGCACCCTGG 

TGGGCAGAGA 

180 

AGCATGTCCT 

CACAGCTGCC 

CACTGCATAC 

AC G ATGG AAA 

AACCTATGTG 

AAAGGAACCC 

240 

AGAAGCTTCG 

AGTGGGCTTW 

CTAAAGCCCA 

AGTTTAAAGA 

TGGTGGTCGA 

NGGGNCAACG 

300 

ACTTWCACTT 

CAGCCATGNC 

CGAGCAGATG 

NATTTTCAGT 

GGGTCCGGTG 

GNAAGGAACC 

360 

CTGNTGCCCA 

AGGGTTTGGG 

TTCAGGGGGN 

ATGGCATGAN 

CNAGGNATGG 

GTTTATGATT 

420 

ATGCCTCNNG 

GGAATTCAAA 

AGGCCCANAA 

GGGGAAATTT 

NTNANGGTNN 

GGGTGNGGCC 

480 


TNCTGTTANG AAATT 


495 
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<210> 130 
<211> 288 
<212> DNA 
<213> Homo sapiens 

<220> 

<221> SITE 
<222> (12) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (24) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (31) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (69) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (102) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (200) 

<223> n equals a,t,g, or c 
<400> 130 


CCCACACCTG TNTGAGCGGC GCANGAGCCG NGGCCCGGGC GGGCTGCTCG GCGCGGAACA 60 

GTGCTCGGNA TGGCAGGGAT TCCAGGGCTC CTCTTCCTTC TNTTC TTTCT GCTCTGTGCT 12 0 

GTTGGGCAAG TAAGCCCTTA CAGTGCCCCC TGGAAACCCA CTTGGCCTGC ATACCGCCTC 18 0 

CCTGTCGTCT TGCCCCAGTN TACCCTCAAT TTAGCCAAGC CAGACTTTGG AGCCGAAGCC 240 

AAATTAGAAG TATCTTCTTC ATGTGGACCC CAGTGTCATA AGGGAACT 2 88 


<210> 131 
<211> 423 
<212> DNA 
<213> Homo sapiens 

<220> 

<221> SITE 
<222> (5) 

<22 3> n equals a,t,g, or c 
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<220> 

<221> SITE 
<222> (49) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (61) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (79) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (90) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (224) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (251) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (272) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (300) 

<22 3> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (308) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (315) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (363) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
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<222> {375) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 

<222> (392) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 

<222> (396) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 

<222> (405) 

<223> n equals a,t,g, or c 

<400> 131 


GGCANAGGGT 

GGTCGAGGGG 

CCAACGACTC 

CACTTCAGCC 

ATGCCCGANC 

AGATGAAATT 

60 

NCAGTGGATC 

CGGGTGAANG 

CACCCATGTN 

GCCAAGGGTT 

GGATCAAGGG 

CAATGCCAAT 

120 

GAACATCGGC 

ATGGATTATG 

AATTATGCCC 

CTCCTGGGAA 

CTGCAAAAAG 

CCCCACAAGA 

180 

GAAAATTTAT 

GAAAGATTGG 

GGTGGAGCCC 

TCCTGCTTAA 

GCANCTGCCA 

GGGGGCAGAA 

240 

TTGCACTTCT 

NTGGTTATGA 

ACAATGAACC 

GNCCAGGGCA 

ATTTGGTGTG 

ATCGCTTCTN 

300 

TGAACGTNCA 

AAGANGAGGA 

CCTATGGACT 

TGGTTCTTAC 

CCAGCAATGG 

GGATTGCCCC 

360 

AGNCCAGGGG 

GCCANCGGGG 

TTTGGGGGGT 

TNTTTNTTTT 

AAGGNTTTTT 

GGGAAGGGGG 

420 

CCA 






423 


<210> 132 

<211> 145 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> SITE 
<222> (66) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (76) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (104) 

<223> n equals a,t,g, or c 


<220> 
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<221> SITE 
<222> (112) 

<223> n equals a,t,g, or c 


<220> 

<221> SITE 
<222> (134) 

<223> n equals a,t,g, or c 


<400> 132 

TTGCATAGAA ATAAAAAAAA TACTGATTTG GGGCAATGAG GAATATTTGA CAATTAAGTT 
AATCTNCACG TTTTTNCAAA CTTTGGATTT TTATTTCATC TGANCTTGTT TNAAAGATTT 
ATATTAAATA TTTNGCATAC AAGAG 


<210> 133 

<211> 340 

<212> DNA 

<213> Homo sapiens 


<220> 

<221> SITE 
<222> (260) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (295) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (313) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (314) 

<223> n equals a,t,g, or c 


<400> 133 
CAGAAGTGGG 

AGCGAAAAAT 

TATTGGCATT 

TTTTCAGGGC 

ACCAGTGGGT 

GGACATGAAT 

60 

GGTTCCCCAC 

AGGATTTCAA 

CGTGGCTGTC 

AGAATCACTC 

CTCTCAAATA 

TGCCCAGATT 

120 

TGCTATTGGA 

TTAAAGGAAA 

CTACCTGGAT 

TGTAGGGAGG 

GGTGACACAG 

TGTTCCCTCC 

180 

TGGCAGCAAT 

TAAGGGTCTT 

CATGTTCTTA 

TTTTAGGAGA 

GGCCAAATTG 

TTTTTTGTCA 

240 

TTGGCGTGCA 

CACGTGTGTN 

TGGTGTGTGT 

GGTGTGTGTG 

TAAAGGTGTC 

TTATWAATCT 

300 


TTTTACCCAT TTNNTTACAA TTGCAAGATG ACTGGCTTTA 


340 
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<210> 134 
<211> 99 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> SITE 
<222> (51) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (56) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (57) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (59) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (79) 

<223> n equals a,t,g, or c 
<400> 134 

GGAGCCGCGC GCTCTCTCCC GGCGCCCACA CCTGTCTGAG CGGCGCACGA NGCCGNNGNC 6 0 

CCGGGCGGGC TGCTCGGCNC GGAACAGTGC TCGGCATGG 99 


<210> 135 
<211> 161 
<212> DNA 
<213> Homo sapiens 


<220> 
<221> 
<222> 
<223> 


SITE 
(4) 

n equals 


a,t,g, or c 


a, t , g, or c 


<220> 

<221> SITE 
<222> (12) 
<223> n equals 


<220> 

<221> SITE 
<222> (28) 

<223> n equals a,t,g, or 


<220> 
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<221> SITE 
<222> (58) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (101) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (109) 

<223> n equals a,t,g, or- c 
<220> 

<221> SITE 
<222> (152) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (156) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (157) 

<22 3> n equals a,t,g, or c 
<400> 135 

GCANCGAGCC GNGGC CCGGG CGGGCTGNTC GGCGCGGAAC AGTGCTGCGG CATGGCANTA 
TATTCCAGGG CTCCTCTTCC TTCTGTTCTT TCTGCTTATG NGCTGTTGNG CAAGTAGAGC 
CCTTACAGTT GCCCACTGTA AACCCACTTG GNCTGNNTAC C 


<210> 136 

<211> 442 

<212> PRT 

<213> Homo sapiens 

<400> 136 

Met Ala Ser Val Val Leu Pro Ser Gly Ser Gin Cys Ala Ala Ala Ala 
1 5 10 15 

Ala Ala Ala Ala Pro Pro Gly Leu Arg Leu Arg Leu Leu Leu Leu Leu 
20 25 30 

Phe Ser Ala Ala Ala Leu He Pro Thr Gly Asp Gly Gin Asn Leu Phe 
35 40 45 

Thr Lys Asp Val Thr Val He Glu Gly Glu Val Ala Thr He Ser Cys 
50 55 60 

Gin Val Asn Lys Ser Asp Asp Ser Val He Gin Leu Leu Asn Pro Asn 
65 70 75 80 
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Arg Gin Thr lie Tyr Phe Arg Asp Phe Arg Pro Leu Lys Asp Ser Arg 
85 30 95 

Phe Gin Leu Leu Asn Phe Ser Ser Ser Glu Leu Lys Val Ser Leu Thr 
100 105 110 

Asn Val Ser He Ser Asp Glu Gly Arg Tyr Phe Cys Gin Leu Tyr Thr 
115 120 125 

Asp Pro Pro Gin Glu Ser Tyr Thr Thr He Thr Val Leu Val Pro Pro 
130 135 140 

Arg Asn Leu Met lie Asp He Gin Lys Asp Thr Ala Val Glu Gly Glu 
145 150 155 160 

Glu lie Glu Val Asn Cys Thr Ala Met Ala Ser Lys Pro Ala Thr Thr 
165 170 175 

He Arg Trp Phe Lys Gly Asn Thr Glu Leu Lys Gly Lys Ser Glu Val 
180 185 190 

Glu Glu Trp Ser Asp Met Tyr Thr Val Thr Ser Gin Leu Met Leu Lys 
195 200 205 

Val His Lys Glu Asp Asp Gly Val Pro Val He Cys Gin Val Glu His 
210 215 220 

Pro Ala Val Thr Gly Asn Leu Gin Thr Gin Arg Tyr Leu Glu Val Gin 
225 230 235 240 

Tyr Lys Pro Gin Val His He Gin Met Thr Tyr Pro Leu Gin Gly Leu 
245 250 255 

Thr Arg Glu Gly Asp Ala Leu Glu Leu Thr Cys Glu Ala lie Gly Lys 
260 265 270 

Pro Gin Pro Val Met Val Thr Trp Val Arg Val Asp Asp Glu Met Pro 
275 2.80 285 

Gin His Ala Val Leu Ser Gly Pro Asn Leu Phe He Asn Asn Leu Asn 
290 295 300 

Lys Thr Asp Asn Gly Thr Tyr Arg Cys Glu Ala Ser Asn He Val Gly 
305 310 315 320 

Lys Ala His Ser Asp Tyr Met Leu Tyr Val Tyr Asp Pro Pro Thr Thr 
325 330 335 

He Pro Pro Pro Thr Thr Thr Thr Thr Thr Thr Thr Thr Thr Thr Thr 
340 345 350 

Thr He Leu Thr He He Thr Asp Ser Arg Ala Gly Glu Glu Gly Ser 
355 360 365 

He Arg Ala Val Asp His Ala Val He Gly Gly Val Val Ala Val Val 
370 375 380 
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Val Phe Ala Met Leu Cys Leu Leu lie lie Leu Gly Arg Tyr Phe Ala 
385 390 395 400 

Arg His Lys Gly Thr Tyr Phe Thr His Glu Ala Lys Gly Ala Asp Asp 
405 410 415 

Ala Ala Asp Ala Asp Thr Ala He He Asn Ala Glu Gly Gly Gin Asn 
420 425 430 

Asn Ser Glu Glu Lys Lys Glu Tyr Phe lie 
435 440 


<210> 137 

<211> 1329 

<212> DNA 

<213> Homo sapiens 

<400> 137 

ATGGCGAGTG TAGTGCTGCC GAGCGGATCC CAGTGTGCGG CGGCAGCGGC GGCGGC GGCG 60 

CCTCCCGGGC TCCGGCTCCG GCTTCTGCTG TTGCTCTTCT CCGCCGCGGC ACTGATCCCC 12 0 

ACAGGTGATG GGCAGAATCT GTTTACGAAA GACGTGACAG TGATCGAGGG AGAGGTTGCG 180 

ACCATCAGTT GCCAAGTCAA TAAGAGTGAC GACTCTGTGA TTCAGCTACT GAATCCCAAC 24 0 

AGGCAGACCA TTTATTTCAG GGACTTCAGG CCTTTGAAGG ACAGCAGGTT TCAGTTGCTG 300 

AATTTTTCTA GCAGTGAACT CAAAGTATCA TTGACAAACG TCTCAATTTC TGATGAAGGA 3 60 

AGATACTTTT GCCAGCTCTA TACCGATCCC C C AC AGGAAA GTTACACCAC CATCACAGTC 42 0 

CTGGTCCCAC CACGTAATCT GATGATCGAT ATCCAGAAAG AC AC TGCGGT GGAAGGTGAG 480 

GAGATTGAAG TCAACTGCAC TGCTATGGCC AGCAAGCCAG CCACGACTAT CAGGTGGTTC 540 

AAAGGGAACA CAGAGCTAAA AGGCAAATCG GAGGTGGAAG AGTGGTCAGA CATGTACACT 600 

GTGACCAGTC AGCTGATGCT GAAGGTG C AC AAGGAGGACG ATGGGGTCCC AGTGATCTGC 6 60 

CAGGTGGAGC ACCCTGCGGT CACTGGAAAC CTGCAGACCC AGCGGTATCT AGAAGTACAG 72 0 

TATAAGCCTC AAGTGCACAT TCAGATGACT TATCCTCTAC AAGGCTTAAC CCGGGAAGGG 7 80 

GACGCGCTTG AGTTAACATG TGAAGCCATC GGGAAGCCCC AGCCTGTGAT GGTAACTTGG 840 

GTGAGAGTCG ATGATGAAAT GCCTCAACAC GCCGTACTGT CTGGGCCCAA CCTGTTCATC 900 

AATAACCTAA ACAAAACAGA TAATGGTACA TACCGCTGTG AAGCTTCAAA CATAGTGGGG 960 

AAAGCTCACT CGGATTATAT GCTGTATGTA TACGATCCCC CCACAACTAT CCCTCCTCCC 102 0 

ACAACAACCA CCACCACCAC CACCACCACC ACCACCACCA TCCTTACCAT CATCACAGAT 108 0 

TCCCGAGCAG GTGAAGAAGG CTCGATCAGG GCAGTGGATC ATGCCGTGAT CGGTGGCGTC 114 0 

GTGGCGGTGG TGGTGTTCGC CATGCTGTGC TTGCTCATCA TTCTGGGGCG CTATTTTGCC 12 00 
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AGACATAAAG GTACATACTT CACTCATGAA GCCAAAGGAG CCGATGACGC AGCAGACGCA 12 6 0 

GACACAGCTA TAATCAATGC AGAAGGAGGA CAGAACAACT CCGAAGAAAA GAAAGAGTAC 13 2 0 

TTCATCTAG 132 9 


<210> 138 
<211> 538 
<212> PRT 
<213> Homo sapiens 


<400> 138 
Met Ala Arg Ala 
1 

Leu Leu Trp Pro 
20 

Asp Val Arg Val 
35 

Thr Val Glu Leu 
50 

lie Ser Leu Val 
65 

Asn Val Ala Ala 


Lys Pro Gly Ser 
100 

Gly Gin Asp Thr 
115 

Gly Leu Thr Val 

130 

Phe Pro Lys Gly 
145 

Lys Pro Lys Asn 


Pro Thr Thr Val 
180 

Arg lie Ser Trp 
195 

Val Ser Gly Thr 
210 


Ala Ala Leu Leu 

5 

Leu Leu Leu Leu 


Gin Val Leu Pro 
40 

Pro Cys His Leu 

55 

Thr Trp Gin Arg 
70 

Phe His Pro Lys 
85 

Glu Arg Leu Ser 


Glu Ala Glu Leu 
12 0 

Glu Asp Glu Gly 
135 

Ser Val Arg Gly 
150 

Gin Ala Glu Ala 
165 

Ala Leu Cys lie 


Leu Ser Ser Leu 
200 

Leu Ala Gly Thr 
215 


Pro Ser Arg Ser 
10 

Leu Leu Leu Glu 
25 

Glu Val Arg Gly 


Leu Pro Pro Val 
60 

Pro Asp Ala Pro 
75 

Met Gly Pro Ser 
90 

Phe Val Ser Ala 
105 

Gin Asp Ala Thr 


Asn Tyr Thr Cys 
140 

Met Thr Trp Leu 
155 

Gin Lys Val Thr 
170 

Ser Lys Glu Gly 
185 

Asp Trp Glu Ala 


Val Thr Val Thr 
220 


Pro Pro Thr Pro 
15 

Thr Gly Ala Gin 
30 

Gin Leu Gly Gly 
45 

Pro Gly Leu Tyr 


Ala Asn His Gin 
80 

Phe Pro Ser Pro 
95 

Lys Gin Ser Thr 
110 

Leu Ala Leu His 
125 

Glu Phe Ala Thr 


Arg Val He Ala 
160 

Phe Ser Gin Asp 
175 

Arg Pro Pro Ala 
190 

Lys Glu Thr Gin 
205 

Ser Arg Phe Thr 


Leu Val Pro Ser Gly Arg Ala Asp Gly Val Thr Val Thr Cys Lys Val 
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225 230 235 240 

Glu His Glu Ser Phe Glu Glu Pro Ala Leu lie Pro Val Thr Leu Ser 
245 250 255 

Val Arg Tyr Pro Pro Glu Val Ser lie Ser Gly Tyr Asp Asp Asn Trp 
260 265 270 

Tyr Leu Gly Arg Thr Asp Ala Thr Leu Ser Cys Asp Val Arg Ser Asn 
275 280 285 

Pro Glu Pro Thr Gly Tyr Asp Trp Ser Thr Thr Ser Gly Thr Phe Pro 
290 295 300 

Thr Ser Ala Val Ala Gin Gly Ser Gin Leu Val lie His Ala Val Asp 
305 310 315 320 

Ser Leu Phe Asn Thr Thr Phe Val Cys Thr Val Thr Asn Ala Val Gly 
325 330 335 

Met Gly Arg Ala Glu Gin Val lie Phe Val Arg Glu Thr Pro Asn Thr 
340 345 350 

Ala Gly Ala Gly Ala Thr Gly Gly lie lie Gly Gly lie lie Ala Ala 
355 360 365 

lie lie Ala Thr Ala Val Ala Ala Thr Gly He Leu He Cys Arg Gin 
370 375 380 

Gin Arg Lys Glu Gin Thr Leu Gin Gly Ala Glu Glu Asp Glu Asp Leu 
385 390 395 400 

Glu Gly Pro Pro Ser Tyr Lys Pro Pro Thr Pro Lys Ala Lys Leu Glu 
405 410 415 

Ala Gin Glu Met Pro Ser Gin Leu Phe Thr Leu Gly Ala Ser Glu His 
420 425 430 

Ser Pro Leu Lys Thr Pro Tyr Phe Asp Ala Gly Ala Ser Cys Thr Glu 
435 440 445 

Gin Glu Met Pro Arg Tyr His Glu Leu Pro Thr Leu Glu Glu Arg Ser 
450 455 460 

Gly Pro Leu His Pro Gly Ala Thr Ser Leu Gly Ser Pro He Pro Val 
465 470 475 480 

Pro Pro Gly Pro Pro Ala Val Glu Asp Val Ser Leu Asp Leu Glu Asp 
485 490 495 

Glu Glu Gly Glu Glu Glu Glu Glu Tyr Leu Asp Lys lie Asn Pro He 
500 505 510 

Tyr Asp Ala Leu Ser Tyr Ser Ser Pro Ser Asp Ser Tyr Gin Gly Lys 
515 520 525 


Gly Phe Val Met Ser Arg Ala Met Tyr Val 
530 535 
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<210> 139 

<211> 499 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> SITE 
<222> (76) 

<2 23> n equals a,t,g, or- c 
<220> 

<221> SITE 
<222> (188) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (203) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (238) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (342) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (347) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (359} 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (382) 

<223> n equals a,t f g, or c 
<220> 

<221> SITE 
<222> (393) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (395) 

<223> n equals a,t,g, or c 
<220> 
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<221> SITE 
<222> (405) 

<22 3> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (406) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (431) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (440) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (444) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (446) 

<22 3> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (449) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (469) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (471) 

<223> n equals a,t,g, or c 
<400> 139 


GACACGAGGC 

AAATC GGAGG 

TGGAAGAGTG 

GTCAGACATG 

TACACTGTGA 

CCAGTCAGCT 

60 

GATGCTGAAG 

GTGCANAAGG 

AGGACGATGG 

GGTCCCAGTG 

ATCTGCCAGG 

TGGAGCACCC 

120 

TGCGGTCACT 

GGAAACCTGC 

AGACCCAGCG 

GTATCTAGAA 

GTACAGTATA 

AGCCTCAAGT 

180 

GCACATTNAG 

ATGACTTATC 

CTNTACAAGG 

CTTAACCCGG 

GAAGGGGACG 

CGCTTGANTT 

240 

AACATGTGAA 

GCCATCGGGA 

AGCCCCAGCC 

TGTGAATGGT 

AAACTTGGGT 

GAGAAGTGCG 

300 

ATGATGAAAT 

GCCTCAACAC 

GCCGTACTGT 

TTGGGCCCAA 

CNTGTTNCAT 

TCATTAACNT 

360 

AAACAAAACA 

GTTAATGGTA 

CNTACCGTTG 

TGNANGTTTC 

AAACNNAGTG 

GGGGAAAGTT 

420 
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CATTCGGGTT NTATGCTGTN TGTNTNAGNT TCCCCCCACA ATTTTCCTNC NTCCCACAAC 480 
AACCACCACC ACCACCACC 49s 


<210> 140 
<211> 245 
<212> DNA 
<213> Homo sapiens 


<220> 

<221> SITE 
<222> (44) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (46) 

<22 3> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (152) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (154) 

<223> n equals a,t,g, or c 


<400> 140 

CAAGTTCCAA TATCACTGTC TCTTTATCAT CTAAATAGGG CCANTNGGAC ACCTCATTGA 6 0 

AACAAAAAGG CTGATCTAGA TGAAGTACTC TTTCTTTTCT TCGGAGTTGT TCTGTCCTCC 12 0 

TTCTGCATTG ATTATAGCTG TGTCTGCGTC TNCNGCGTCA TCGGCTCCTT TGGCTTCATG 180 

AGTGAAGTAT GTACCTTTAT GTCTGGCAAA ATAGCGCCCC AGAATGATGA GCAAGCACAG 2 40 

CATGG 245 


<210> 141 
<211> 324 
<212> DNA 
<213> Homo sapiens 

<220> 

<221> SITE 
<222> (5) 

<223> n equals a,t,g, or c 


<220> 

<221> SITE 
<222> (94) 
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<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (148) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (256) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (316) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (317) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (322) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (324) 

<223> n equals a,t,g, or c 
<400> 141 

GGCANAGGGA ACACAGAGCT AAAAGGCAAA TCGGAGGTGG AAGAGTGGTC AGACATGTAC 60 
ACTGTGAACC AGTCAGCTGA TGCTGAAGGT GCANAAGGAG GACGATGGGG TCCCAGTGAA 120 
TCTGCCAGGT GGAGCACCCT GCGGTCANTG GAAAC CTGC A GAGCCAGCGG TATCTAGAAG 180 
TACAGTATAA GCCTCAAGTG CACATTTCAG ATGACTTATC CTGTACAAGG CTTAACCCGG 240 
AAGGGGACGC GCTTTNAGTT TAACATGTGA AGCCATCGGG GAAGCCCCCA GCCTGTGAAT 3 00 

GGTAAATTTG GGTTANNAGT CNGKF 394 


<210> 142 
<211> 468 
<212> DNA 
<213> Homo sapiens 

<220> 

<221> SITE 
<222> (89) 

<223> n equals a,t,g, or c 


<220> 
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<221> SITE 
<222> (226) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (242) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (245} 

<223> n equals a,t,g ( or c 
<220> 

<221> SITE 
<222> (298) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (314) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (322) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (343) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (348) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (377) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (385) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (408) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (414) 

<223> n equals a,t,g, or c 
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<220> 

<221> SITE 
<222> {421) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (423) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (426) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (428) 

<223> n equals s,t,g, or c 
<220> 

<221> SITE 
<222> (432) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (439) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (442) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (461) 

<223> n equals a,t,g, or c 


<400> 142 
GAATTCGGCA 

GAGCAGGTGC 

CCGACATGGC 

GAGTGTACTG 

CTGCCGAGCG 

GATCCCAGTG 

60 

TGCGGCGGCA 

CGGCGGCGGC 

GGCGCTGCNC 

GGGCTCCGGC 

TCCGGCTTCT 

GCTGTTGCTC 

120 

TTCTCCGCCG 

CGGCACTGAT 

CCCCACAGGT 

GATGGGCAGA 

ATCTGTTTAC 

GAAAGACGTG 

180 

ACAGTGATCG 

AGGGAGAGGT 

TGCGACCATC 

AGTTGCCAAG 

TCAATNAAGA 

GTGACGACTC 

240 

TNTGNATTCA 

GCTACTGAAT 

CCCAACAGGC 

AGACCATTTA 

TTTCAGGGGA 

CTTCAGGNCT 

300 

TTGAAGGACA 

GCANGTTTTC 

ANTTGCTTGA 

AATTTTTC T A 

GCNATTGNAA 

CTCAAAAGTG 

360 

TTCATTTGAC 

AAACGTNTCA 

ATTTNTGATG 

AAGGGAGGTT 

AATTTTTNGC 

CAANTTTTTT 

420 

NANCCNTNCC 

CNAACAGGNA 

ANTTACAACC 

ACCTTTAAAA 

NTCCTGGT 


468 
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<210> 143 
<211> 279 
<212> DNA 
<213> Homo sapiens 


<220> 

<221> SITE 
<222> (23) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (118) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (190) 

<22 3> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (201) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (223) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (238) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (260) 

<223> n equals a,t,g, or c 


<400> 143 
AAATGCCTCA 

ACACGCCGTA 

CTNTGTGGGC 

CCAACCTGTT 

CATCAATAAC 

CTAAACAAAA 

60 

CAGATAATGG 

TACATACCGC 

TGTGAAGCTT 

CAAACATAGT 

GGGGAAAGCT 

CACTCGGNTT 

120 

ATATGCTGTA 

TGTATACGAT 

CCCCCCACAA 

CTATCCCTCC 

TCCCACAACA 

ACCACCACCA 

180 

CCACCACCAN 

CACCACCACC 

NCCATCCTTA 

CCATCATCAC 

AGNTTCCCGA 

GCAGGTGNAG 

240 

AAGGCTCGAT 

CAGGGCAGTN 

GATCATGCCG 

TGATCGGTG 



279 


<210> 144 
<211> 393 
<212> PRT 
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<213> Homo sapiens 
<400> 144 

Met Trp Trp Arg Val Leu Ser Leu Leu Ala Trp Phe Pro Leu Gin Glu 
15 10 15 

Ala Ser Leu Thr Asn His Thr Glu Thr lie Thr Val Glu Glu Gly Gin 
20 25 30 

Thr Leu Thr Leu Lys Cys Val Thr Ser Leu Arg Lys Asn Ser Ser Leu 
35 40 45 

Gin Trp Leu Thr Pro Ser Gly Phe Thr lie Phe Leu Asn Glu Tyr Pro 
50 55 60 

Ala Leu Lys Asn Ser Lys Tyr Gin Leu Leu His His Ser Ala Asn Gin 
65 70 75 80 

Leu Ser lie Thr Val Pro Asn Val Thr Leu Gin Asp Glu Gly Val Tyr 
85 90 95 

Lys Cys Leu His Tyr Ser Asp Ser Val Ser Thr Lys Glu Val Lys Val 
100 105 110 

lie Val Leu Ala Thr Pro Phe Lys Pro lie Leu Glu Ala Ser Val lie 
115 120 125 

Arg Lys Gin Asn Gly Glu Glu His Val Val Leu Met Cys Ser Thr Met 
130 135 140 

Arg Ser Lys Pro Pro Pro Gin lie Thr Trp Leu Leu Gly Asn Ser Met 
145 150 155 160 

Glu Val Ser Gly Gly Thr Leu His Glu Phe Glu Thr Asp Gly Lys Lys 
165 170 175 

Cys Asn Thr Thr Ser Thr Leu lie lie Leu Ser Tyr Gly Lys Asn Ser 
180 185 190 

Thr Val Asp Cys lie lie Arg His Arg Gly Leu Gin Gly Arg Lys Leu 
195 200 205 

Val Ala Pro Phe Arg Phe Glu Asp Leu Val Thr Asp Glu Glu Thr Ala 
210 215 220 

Ser Asp Ala Leu Glu Arg Asn Ser Leu Ser Thr Gin Asp Pro Gin Gin 
225 230 235 240 

Pro Thr Ser Thr Val Ser Val Thr Glu Asp Ser Ser Thr Ser Glu lie 
245 250 255 

Asp Lys Glu Glu Lys Glu Gin Thr Thr Gin Asp Pro Asp Leu Thr Thr 
260 265 270 

Glu Ala Asn Pro Gin Tyr Leu Gly Leu Ala Arg Lys Lys Ser Gly He 
275 280 285 


Leu Leu Leu Thr Leu Val Ser Phe Leu He Phe lie Leu Phe He He 
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290 295 300 

Val Gin Leu Phe lie Met Lys Leu Arg Lys Ala His Val lie Trp Lys 
305 310 315 320 

Arg Glu Asn Glu Val Ser Glu His Thr Leu Glu Ser Tyr Arg Ser Arg 
325 330 335 


Ser Asn Asn Glu Glu Thr Ser Ser 
340 

Leu Pro Met Arg Cys Met Asn Tyr 
355 360 

Lys Thr Lys Arg Lys Glu Asn Val 
370 375 

His lie Gin Val Pro Glu Ser lie 
385 390 


Glu Glu Lys Asn Gly Gin Ser Ser 
345 350 

He Thr Lys Leu Tyr Ser Glu Ala 
365 

Gin His Ser Lys Leu Glu Glu Lys 
380 

Val 


<210> 145 
<211> 439 
<212> PRT 
<213> Homo sapiens 


<400> 145 

Met Thr Phe Thr 

1 

Gly Asp Phe Pro 
20 

Glu Asp Leu Asn 
35 

Lys Gin Trp Leu 
50 

Trp Gly Leu Lys 
65 

Glu Leu Ser lie 


Tyr Lys Cys Tyr 
100 

Val Glu Val Leu 
115 

Asp Thr Glu Gly 
130 

Pro Gin Pro Gin 
145 


Thr Val Leu His 
5 

Gly Ala Gly Ser 


Leu Arg Cys Thr 
40 

Asn Pro Arg Gly 
55 

Asp Gin Arg Tyr 
70 

Arg Leu Ser Asn 
85 

Tyr Tyr Ser Thr 


Ala Ala Pro Ser 
120 

Arg Val Thr Leu 
135 

He Thr Trp Leu 
150 


Val Ala Ala Leu 
10 

Glu Thr He Thr 
25 

Phe Thr Gly Asp 


Phe Thr He Phe 
60 

Arg Leu He His 
75 

He Thr Val His 
90 

Pro Phe Arg Ser 
105 

Lys Pro Val Leu 


Ser Cys Tyr Thr 

140 

Leu Asp Asn Gly 
155 


Leu Leu Leu Gin 
15 

Val Gin Glu Gly 
30 

Ser Arg Ala Thr 
45 

Leu Asp Asn His 


Tyr Ser Glu Asp 
80 

Asp Glu Gly Val 

95 

Lys Met Thr Thr 
110 

Gin Val Ser Arg 
125 

Gin Gly Cys Lys 


He Gin Leu Pro 
160 


Gly Asp Thr Arg His Lys Leu Glu Ala Asp Gly Lys Lys Trp Thr Thr 
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155 

165 170 175 

Thr Ser Thr Leu Thr Val Leu Ala Tyr Gly Pro Asn Ser Thr Ala Thr 
180 185 190 

Cys Leu Val His His Lys Ala Leu Gly Gly Gly Lys Leu Thr Glu Pro 
195 200 205 

Phe Gin Phe Glu Asp Val Ala Arg Thr Val Ala Asn Thr Thr Pro Val 
210 215 220 

Ser Thr Thr Leu Glu Val Asp Thr Tyr Val Ser Glu Tyr Val Gin Pro 
22 5 230 235 240 

Thr Val Thr Thr Ala Glu Ser Asp Leu Asn Ser Asn Thr Asp Phe Ser 
245 250 255 

Pro Ser Tyr Pro Gin His Asn Gly Ser Gly Ala Thr Ser Val Ala Ala 
260 265 270 

Gly Glu Leu Ser Gly Thr Ser Ala His His He Pro Glu Gly Thr Glu 
2V5 280 285 

Thr Ala Leu Asn Gly Thr Val Thr Glu Glu Leu Phe Arg Thr Glu Ala 
290 295 300 

Ser Phe Pro Ser Glu Asn Val Thr Leu lie Ser He Val Thr Phe Glu 
305 310 315 320 

Gin Asp Val Lys Ser Glu Gly Met Ser Lys Lys Glu Lys Asp Phe Leu 
325 330 335 

Leu Pro Leu Leu Val Ala Val Leu lie Val Met Leu Leu lie lie Val 
340 345 350 

Val Leu Phe Ser Lys Lys Leu lie Lys Pro His Gly Val Trp Lys Arg 
355 360 365 

Glu Asn Asp Thr Ser Asp Gin Thr Leu Glu Ser Tyr Lys Ser Lys Ser 
370 375 380 

Asn Glu Glu Ser Pro Gly His Glu Lys Asn Gly Gin Ala Val Ser Gin 
385 390 395 400 

Lys Pro Asn Met Gin Tyr Val Thr Glu Gly Tyr Val Glu Ala Thr Gin 
405 410 415 

Lys Asn Pro Ser Glu Lys Asn Thr Lys Leu Pro Glu Glu Gin Phe Ala 
420 425 430 

Arg Gly Lys Glu Thr Asp Val 
435 


<210> 146 
<211> 252 
<212> PRT 
<213> Homo sapiens 
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<400> 146 
Ser Arg Asp Gly 
1 

Thr Pro Cys Ser 
20 

Gly Asp Pro Pro 
35 

Tyr Leu Ser Ser 
50 

Leu His Gly Trp 
65 

Gly Tyr Ser Ser 


His Thr Met Val 
100 

Leu Tyr Trp Leu 
115 

Phe Cys Cys Asp 
130 

Gin Met Glu Lys 
145 

Gin Asp Phe Ser 


Leu Leu Leu Leu 
180 

Phe Ala He Leu 
195 

Asn Ser Leu Leu 
210 

Cys Trp His Leu 
225 

Glu Ser Arg Val 


Phe His Arg Val 
5 

Ser Pro Leu Gly 


Arg Pro Val Leu 
40 

Ser Asn Leu Lys 
55 

Ala Gin Leu Val 

70 

Glu Pro Gly Thr 
85 

Leu Trp Pro Gin 


Leu Arg Gly Arg 
120 

His Pro Pro Ser 
135 

Thr Leu Ser Gly 

150 

Arg Lys Tyr Tyr 
165 

Ser Ala Pro Phe 


Ser Asn He Leu 
200 

Phe Met Tyr Lys 
215 

Pro Leu Lys Ser 
230 

Thr Leu Phe Tyr 
245 


Ser Gin Asp Gly 
10 

Leu Pro Lys Cys 
25 

Glu Asp Gly Ser 


Glu Val Leu Ala 
60 

His Leu Pro Phe 
75 

Leu Leu Ser Ala 
90 

Phe Tyr Arg Ser 

105 

Arg Asn Asn Thr 


Tyr Pro Leu His 
140 

Asp Val His His 
155 

His Pro Gly lie 
170 

His Ser He Asn 
185 

Met His Ser His 


Val Arg Asn Leu 
220 

Lys Ser Lys Cys 
235 

Gly Ser Cys Ser 
250 


Leu Asp Leu Leu 
15 

Trp Asp Tyr Arg 
30 

Glu Ser Leu Glu 
45 

Cys Arg Gly Ser 


Ser Ala Tyr Ala 
80 

Glu Leu Lys Leu 
95 

He Leu Tyr Leu 
110 

Lys Pro Lys Pro 

125 

Phe Arg Leu Tyr 


Gin Tyr Tyr His 
160 

Cys Trp Leu Gin 
175 

Met Leu Arg Glu 
190 

Lys Leu Gin Cys 
205 

Cys Leu Leu Pro 


Ser Tyr Ala Arg 
240 


<210> 147 
<211> 369 
<212> DNA 
<213> Homo sapiens 

<220> 

<221> SITE 
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<222> (5) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (19) 

<22 3> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (80} 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> {152) 

<22 3> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (251) 

<223> n equals a,t ( g, or c 
<220> 

<221> SITE 
<222> (255) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (286) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (301) 

-<2 23> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (346) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (350) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (368) 

<223> n equals a,t,g, or c 
<400> 147 

GGCANAGTGA CACCAGGGNA GGGAATTAAT TCTCCCAAGG AAAACGCATT GAGTTACATG 60 


AAAAGGGGTC TAGGATTGAN CACTGCACTG AGAAACAACA AC ATC TAG AT GATGGATAGG 


120 


WO 00/29435 

GGAGGAAGAA AGATTTCAGT TAAGGAGAAT 
ACCAGGATAG TGCCATGCAA AGGAAGGTGT 
AAATGCGACT NCGANTGTCA AAGAAGTTTG 
NGTTAGAGTA GTTCTGGTGG AATGGTAGAC 
GAAAGGGNG 


PCT/US99/25031 
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GNAAGTTTCT CCTGAAATGA AAGGAAGAAA ISO 
TTCAAAGAGG AGGAAGAGAT TAACAGTGCC 240 
AAAGAAATAA GGGTTNTCCT GGGTAGCCTT 3 00 

ATGGGAGTTA CCTTTNTATN GGTTGGGAAG 3 60 

369 
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